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RECOMBINATION OF POLYNUCLEOTIDE SEQUENCES 
USING RANDOM OR DEFINED PRIMERS 

The U.S. Government has certain rights in this invention pursuant to 
Grant No. DE-FG02-93-CH 10578 awarded by the Department of Energy- ajid 
Grant No. NOOO 14-96- 1-0340 awarded by the Office of Naval Research. 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

The present invention relates generally to in vitro methods for 
mutagenesis and recombination of polynucleotide sequences. More 
particularly, the present invention involves a simple and efficient method for in 
intro mutagenesis and recombma'Jon of polynucleotide sequences based on 
pol>-merasc-cataJ>^ed extension of pnmer oligonucleotides, followed by gene 
assembly and optional gene amplifica'Jon. 

2. D esc notion of Related Ar: 

The publications and otr.c: reference materials referred to herein to 
describe the background of the invention and to provide additional detail 
re[;arding its practice arc hereby incorporated by reference. For convenience, 
the reference materials arc numcr.caily referenced and grouped in the 
appended bibliography. 

Pro :e ins are engineered -.vi:h the goal of improving their performance 
for practical applications. Desirable properties depend on the application of 
interest and may include tighter binding to a receptor, high catalytic activity, 
high stability-, the ability to accept a wider (or narrower) range of substrates, or 
Lhe ability to function in nonnatural environments such as organic solvents. A 
variety- of approaches, including 'rational' design and random mutagenesis 
methods, have been successfully used to optimize protein functions (1). The 
choice of approach for a given opl:m'^ation problem will depend upon the 
degree of understanding of the rela-or.ships between sequence, structure and 
function. The rational redesign c: nr. cr.r.-me catalrac site, for example, often 
'requires extensive knowlcdce of tr.c cnr^^Tne structure, tine sO^icturcs of its 
comnlcxcs u-ith various ligands and r.^nalogs of reaction intermediates and 
details of the catalytic mechanism. Such information is available only for a 
vcr>' few well-studied systems; little is kno%vn about the vast majority of 
potentially interesting cnr^mes. Identify-ing the amino acids responsible for 
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cxisting protein functions and those which might give nse to new funcuons 
remains an often-overwhelming challenge. This, together with the growing 
appreciation that many protein functions are not confined to a small number 
of amino acids, but are affected by residues far from active sites, has 

. prompted a growing number of groups to turn to random mutagenesis, or 
•directed' evolution, to engineer novel proteins (1). 

Various optimization procedures such as genetic algorithms (2,3) and 

•. evolutionary strategies (4.5) have been inspired by natural evolution. These 
procedures employ muution. which makes small random changes in 
members of the population, as well as crossover, which combines properties of 
different individuals, to achieve a specific optimization goal. There also e.xist 
strong interplays between mutation ajid' crossover, as shown by computer 
simulations of different optimization problems (6-9). Developing efficien: and 
practical experimental techniques to mimic these key processes is a scientific 
chaUenge. The application of such techniques should allow one, for example, 
to explore and optimize the functions of biological molecules such as proteins 
and nucleic acids, in vivo or even completely free from Lhe constraints of a 

living system (10,1 1). 

Directed evolution, inspired by natural evolution, involves the 
generation and selection or scrc:ni.".g of a pool of mutated molecules which 
has sufficient diversity for a molecule encoding a protein with altered or 
cnharxed function to be prcscr.- therein. It generally begins with creation of a 
libra.-/ of mutated genes. Gene products which show improvement with 
respect to the desired properr,- or set of propeoies arc identified by selection 
or screening. The genels) encoding those products can be subjected to further 
cycles of the process in order to accumulate beneficial mutations. This 
evolution can involve few or many generations, depending on how far one 
wishes to progress and the effects of mutations typically observed in each 
generation. Such approaches have been used to create novel functional 
nucleic acids (12), peptides and other small molecules (12). antibodies (12). as 
weU as enzymes and other proteins (13,14,16). Directed evolution requires 
Uttle specific knowledge about tine product itself, only a means to evaluate the 
function to be optimized. These procedures arc even fairly tolerant to 
•inaccuracies and noise in the func'^Dn evaluation (15). 

The diversit>' of genes for directed evolution can be created by 

■ • r^ni-i' -int?tio--s usin" a variety of methods, including 

introqucmg new pomi. muiauo..^ ua^aio a. j 

muugenic PGR (15) or combinatorial cassette mutagenesis (16). The ability to 
recombine genes, however, ca.n add an unportant dimension to the 
evolutionary process, as evidenced by its key role in natural evoluuon. 
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Homologous recombination is an important natural process in which 
organisms exchange genetic information between related genes, increasing the 
accessible genetic diversity within a species. While introducing potentially 
powerful adaptive and diversification competencies into their hosts, such 
pathways also operate at very low efTiciencics, often eliciting insignificant 
changes in pathway structure or funcdon, even after tens of generations. 
Thus, while such mechanisms prove beneficial to host organisms/species over 
geological time spans, in lAuo rccombinadon methods represent cumbersome, 
if not unusable, combmatoriai processes for tailoring the performance of 
enzymes or other proteins not strongly Unked to the organism's intermediary 
metabolism and survival. 

Several groups have recognized the utility of gene recombination in 
directed evolution. Methods for in uiuo recombination of genes are disclosed, 
for example, in published PCT applicadon V/0 97/07205 and US Pal. No. 
5,093,257, As discussed above, these in uiuo methods are cumbersome and 
poorly opdmLzed for rapid evoluuon of funcdon. Stcmmcr has disclosed a 
method for in i/.tro recombmation of related DN.A sequences in which the 
parental sequences are cut into fragments, generally using an enz>Tne such as 
DNase 1, and are reassembled (17. IS. 19). The non-random DNA 
fragmentation associated ^^1th D.Nase 1 and other cndonuclcascs. however, 
introduces bias into tine recombL-ation and Umits the recombinadon diversity-. 

Furthermore, this m.cLnod is Lmitcd to recombination of double-stranded 
polyr.uclcotidcs and car.no: be used on single-stranded templates. Further, 
this method docs no: '^'orr: '^-eli v.nth ccnain combinadons of genes and 
primers. It is not efficient for recombination of short sequences (less thar. 200 
nucleodces (nts)), for c.vam.ple. Finally, it is quite laborious, requiring several 
steps. Altemadvc, convenient methods for creating novel genes by point 
m.utagenesis and recom.binauon in uitro are needed. 

SUMMARY OF THE INVENTION 
The present invention provides a new and significantly improved 
approach to creatLng novel pol>mucleodde sequences by point mutadon and 
rccombunalion in lAlro of a sc: of parental sequences (the templates). The 
■ novel pol>mucieodde sequences car. be useful in themselves (for example, for 
DNA-based compudng). or they can be expressed in recombinant organisms 
for directed evoludon of the gene products. One embodiment of the invcndon 
involves primLng the template gene(s) ^^-ith random-sequence oligonucleoudcs 
to generate a pool of shoa DNA fragments. Under appropriate reacdon 
condidons. th.ese short DNA fragments car. prime one another based on 



wo 98/42832 



PCTAJS98/05956 



complementarity and thus can be reassembled to form full-length genes by 
repeated thermocycling in the presence of thermostable DNA polymerase. 
These reassembled genes, which contain point mutations as well as novel 
combinations of sequences from different parental genes, can be further 
amplified by conventional PGR and cloned into a proper vector for expression 
of the encoded proteins. Screening or selecuon of the gene products leads to 
new variants with improved or even novel functions. These variants can be 
used as they are, or they can serve as new starting points for further c>'cles of 
mutagenesis and recombination. 

A second embodiment of the invention involves priming the template 
gene{s) with a set of primer oligonucleotides of defmcd sequence' or defined 
sequence e.xhibiting limited randomness to generate a pool of short DNA 
fragments, which are then reassembled as described above into full length 
genes. 

A third embodiment of the invention involves a novel process we term 
the 'staggered extension' process, or SlEP. Instead of reassembling the pool of 
fragments created by the extended primers, fuU-lcngth genes are assembled 
directly in the presence of th.e icmplatcis). The StEP consists of repeated 
cycles of dcnaturation followed by cctrcmcly abbreviated annealing/extension 
steps. In each cycle the extended fragrr.ents can anncaJ to different icmplatcs 
based on complcmentariry and extend a lit'Je f-^rther to create "recombinant 
rnssrrr?-.<s " Due ro this :rr^p]p.\f r.-.virrh?"^. rr-.nsr 0- the pol>T.uc!ectides 
contain sequences from ciffereni parental genes (i.e. are novel recombinants). 
This process is repeated unul fuli-length genes form. It can be followed by an 
optional gene ampliiicaiion step. 

The different embodiments of the invention provide features and 
advantages for different applicaiions. In the most preferred embodiment, one 
or more defmed primers or defined primers e.xhibidng limited randomness 
which correspond to or fiank the 5" and 3' ends of the template 
pol>mucleotides are used with StE? to generate gene fragments which grow 
mto the novel full-length seq'^ences. This simple method requires no 
knowledge of the template sequencefs). 

In another orefcrred embodL-r.er.t. multiple defined primers or defined 
printers cxhibitmg limited randomness arc used to generate short gene 
iragm.ents which are reassembled into full-length genes. Using muldplc 
defmed primers allows the user to bias in uilro recombination frequency. If 
sequence informadon is available, primers can be designed to generate 
overlapping recombixtation cassettes which increase the frequency of 
recombmadon at particular locations, v^unong other features, this method 
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introduces the nexibUity to take advanUge of available stnicturai and 
functional information as well as information accumulated through previous 
generations of mutagenesis and selection (or screening). 

In addition to recombination, the different embodiments of the primer- 
based recombination process will generate point mutations. It is desirable to 
know and be able to control this point mutation rate, which can be done by 
manipulating the conditions of DNA synthesis and gene reassembly. Using 
the defmcd-primer approach, specific point mutations can also be directed to 
specific positions in the sequence through the use of mutagenic primers. 

The various primer-based recombination methods Ln accordance with 
this invention have been shown to enhance the activity of Actinoplanes 
utahensis ECB deacylase over a broad range of pH values and in the presence 
of organic solvent and to improve the thermostability of Bacillus subtilis 
subtilisin E. DNA sequencing confirms the role of point mutation and 
recombination in the generation of novel sequences. These protocols havc_ 
been found to be'both simple and reiiablc. 

The above discussed and many other features and attendant 
advantages v/ill become better understood by reference to the follou-ing 
detailed description when tai:cn in conjuncuon with the accompanying 
drawings, 

RRIF.F DESCRIFTION OF THE DRAVmOS 
FIG, 1 depicts recon^.binauon in accordance with the present invention 
using random-sequence printers and gene reassembly. The steps sho'AT^ arc: 
a) S>'nLhesis of single-stranded DNA fragments using mesophiiic or 
thermophilic pol>-merase v/i^n rando.-n-sequence oUgonucleotides as primers 
(primers not shown); b) Removal of templates; c) Reassembly with 
thermophiUc DNA pol>'merase; d) Amplification with thermostable 
polymerase(s); e) Cloning and Screening (optional); and f) Repeat the process 
with selected gene(s) (optional). 

FIG. 2 depicts recombination in accordance w-ith the present invention 
using defmed primers. The meir.od is illustrated for the recombination of tv.'0 
genes, where x = mutation. The steps diagrammed axe: a) The genes axe 
primed with defmed primers in PGR reactions that can be done separately (2 
primers per reaction) or combined (multiple primers per reaction); c) Initial 
products are formed until denned primers are exhausted. Template is 
removed footional) ; d) Initial fragments prime and extend themselves in 
further cycles of PGR with no addition of external primers. Assembly 
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continues until fuU-length genes are formed; e) (optional) Full-length genes arc 
amplified in a PGR reaction wth external primers; f) (optional) Repeat the 
process with selected gene(s). 

FIG. 3 depicts recombination in accordance with the present invention 
using two defmed flanking primers and StEP. Only one primer and two single 
strands from two templates are shown here to iUustrate the recombination 
process. The outlined steps are: a) After dcnaturation. template genes are 
primed with one defined primer; b) Short fragments are produced by primer 
extension for a short time; c) In the next cycle of StEP. fragments are 
randomly primed to the templates and e.xtcnded further; d) Dcnaturation and 
annealing/extension is repeated until full-length genes are made (visible on an 
agarose gel); e) Full-length genes are purified, or amplified in a PGR reaction 
with external primers (optional); 0 (optional) Repeat the process with selected 
gene(s). 

FIG. 4 is a diagrammatic representation of the results of the 
recombination of txvo genes using c^vo fianking primers and staggered 
extension in accordance with Lhe present invention. DNA sequences of five 
genes chosen from the recombined librae' are indicated, where x is a mutation 
resent in the parental genes, ar.d Lhe L-iangle represents a new point 



p 

mutation. 



FIG. 5 is a diagrr~.~.a-J: represenia'Jon o.' ir.e sequences of the pN3 
esterase genes described in Ey.^-^-.ple 3, Template genes 2-13 and 5-B12 were 
recorr.bLned using the defined pri^-r^er approach. The positions of the primers 
are indicated by arrov.-s, a;.d tr.e positions u'herc the parental sequences differ 
from one another are indicated by xs. New point mutations arc indicated by 
triangles. Mutations identified in these recombined genes are listed (only 
positions which differ in the parental sequences are listed). Both 6Ed and 
6HI arc recombination products of the template genes. 

FIG. 6 shows the positions ar.d sequences of the four defined intcmai 
• primers used to generate recombir.cd c=nes from template genes Rl ar^.d R2 
by interspersed pr.m=r-based rcco~.bination. Primer P50F contains a 
mutation (A-^T at base position 59S) which simultaneously elLmLnatcs a 
Hindin restriction site ax^-d adds a new unique Nhel site. Gene R2 also 
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contains a mutation A-^G at the same base position, which eliminates the 
Hindlll site. 

FIG, 7 is an electrophoresis gel which shows the results of the 
restriction-digestion analysis of plasmids from the 40 clones, 

FIG. 8 shows the resales of sequencing ten genes from the dcfmed 
primer-based recombination library. Lines represent 986-bp of subtilisin E 
gene including 45 nt of its prosequence, the entire mature sequence and 113 
nt after the stop codon. Crosses indicate positions of mutations from parent 
gene Rl and R2, while triangles indicate .positions of new point mutations 
introduced during the recombination procedure. Circles represent the 
mutation introduced by the mutagenic primer P50F. 

FIG. 9 depicts the results of appl>'ing the random-sequence primer 
rccombinadon method to the gene for Aciinopianes utahensis ECB deac%iase. 
(a) The 2.4 kb ECB dcacylasc gene ^'^s purified from an agarose gel. (b) The 
size of Lhe random priming products ranged from 100 to 500 bases, (c) 
Fragments shorter than 300 bases v.'cre isolated, (d) The purified fragments 
v/crc used to reassemble the full-lengih gene with a smear background, (e) A 
single PCR product of Lhe sarr.c sLzc as the ECB deac/lase gene was obtained 
after convcntionaJ PCR %^'ith th- r^vo prime.-s located at the start and stop 
regions of Lhis gene, (f) Mter digestion with Xho I and Psh AI, the PCR product 
was cloned into a modified p:J702 vector to form a mutant library-. (g) 
IntroducLng this librar;/ into Streptomyces liiAdans TK23 resulted in 
appro.-dmately 71% clones producing the active ECB dcac>'lasc. 

FIG. 10 shows the specific activir/ of the wild-t>-pe ECB deacylase and 
mutant M16 obtained in accordance with Lhe present invention. 

FIG, 11 shows pH profiles of activit>' of the wild-t>-pe ECB deac>'lase 
and m.utant M16 obtained in accordar.ce with the present invention. 

FIG. 12 shows the DNA sequence analysis of 10 clones randomly chosen 
from the library/ Kienow. Lines represent 955-bp of subtilisin E gene including 
45 nt of its prosequencc, the entire mature sequence and 1 13 nt after the stop 
codon. Crosses indicate posidons of mutations from Rl and R2. while 
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triangles indicate positions of new point mutations introduced during the 
raxidom- priming recombination process. 

FIG. 13 Thermostability index profiles of the screened clones from the 
five libraries produced using different polymerases: a) library/ Klenow, b) 
library/T4, c) libraiy/Sequenase, d) library/Stoffel and e) library/Pfu. 

Normali2ed residual activity (Ar/Ai) after incubation at 65°C was used as an 
index of the enzyme thermostability. Data were sorted and plotted in 
descending order. 

DFrr AILED DESCRIPTION OF THE IIWENTION 
In one preferred embodLment of the present invention, a set of primers 

with ail possible nucleotide sequence combinations (dp(N)L where L - primer 
length) is used for Lhe primer-based recombination. It has been known for 
years that o!igodeox>Tiucleotidcs of different lengths can serve as primers for 
initiation of DNA s>Tithesis on single-su-anded templates by the Klenow 
fragment of E.coli polymerase I (21). Although they arc smaller than the size 
of a normal PGR primer (i.e. less tr.an 13 bases), oligomers as short as 
hexanucleotides can adequately pnmc the reaction and are frequently uzcd in 
labeling reactions (22). The use of rar.com primers to create a pool of gene 
fragments foibv/ed by t^errj i -vjci^sc-.TLbiy m accordance witn the mvcntion iS 
shov.-n in FIG. 1. The steps include generation of diverse ^breeding blocks" 
from the sLngle-stranded poiynucieotide templates through rajidom. priming, 
reassembly of the full-length DN'A from, the generated short, nascent DNA 
fra^.cnts by ir.ermocyclLng in the presence of DNA pol>m:icrase and 
nucleotides, and am^plification of th.e desired genes from the reassembled 
products by conventional PGR for further cloning and screening. This 
procedure inL-ocuces new mutations m.a.iniy at the primmg step but also 
durmg ether steps. These new mz-tations ar.d the m.utations already present 
in the template sequences arc reccmbined curmg reasscm.bly to create a 
library-' of novel DNA sequences. The process can be repeated on the selected 

seaucnces. if desired. 

To can^' out the random primir.c procedure, the tem.platc(s) can be 
single- or denatured double-stranded poi:mucleotide(s) in linear or closed 
circular form.. The templates car. be mb:ed in equimolar amounts, or in 
amounts v/eighted, for example, by their functional attributes. Since, at least 
in some cases, the tem.plate genes are cloned in vectors into which no 
additional m.utauons should be Lntrocuced. they are usually first cleaved with 
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restriction endonucleas={s) and purified /om the vectors. The resulting linear 
DNA molecules are denatured by boiling, annealed to random-sequence 
oligodeoxynucleoUdes and incubated with DNA polymerase in the presence of 
an appropriate amount of dNTPs. Hexanucleotide primers are preferred, 
although longer random primers (up to 24 bases) may also be used, depending 
on the DNA polymerase and conditioning used during random priming 
synthesis. Thus the oligonucleotides prime the DNA of interest at various 
positions along the entire target region and are e.vtended to generate short 
DNA fragments complementary to each strand of the template DNA. Due to 
events such as base mis-inco.-porations and mispriming, these short DNA 
fragments also contain point mutations. Under routinely estabUshed reaction 
conditions, the short DNA fragments can prime one another ba^ed on 
homolog>' and be reassembled into full-length genes by repeated 
thermocycling in the presence of thermostable DNA polymerase. The resulting 
fuU-lenr-h genes will have diverse sequences, most of which, however, still 
resemble that of the originai te.-r.platc D.NA. These sequences can be further 
ampUHed by a conventional ?CR ar.d cloned into a vector for expression. 
Screening or selection of the e.>cpress=d mutants should lead to variants with 
improved or even new speciHc functions. These variants can be immediately 
used as partial solutions to a practical p.-ob!cm, or they can serve as new 
starting points for furthc cycles of directed evolution. 

Compared to 00".:r techniques used for protein optimizadon, such as 
combinatorial cassette and oiigo.-.uclcotidc-cirected mutagenesis (24.25,26), 
error-prone ?CR (27. 23). or DN.-. shufnmg (17,16,19), some of -J".e advantages 
of the random-primer based procedure -for in -uilro proteLn -voluuon are 
summarized as follows; 

1. The tem?latc(s) used for rar.dom priming s>-nthesis n^ay be either 
single- or double- sL-andcd po!%muc!eoddes. In contrast, error-prone PGR and 
Uhe DNA shuffUng method for recombination (17,18,19) necessarily employ 
only double-sL-anded polynucleotides. Usmg the technique described here, 
mutations and/or crossovers car. be -introduced at the DNA level by using 
different DNA-dc?endent DNA pol>-.crases, or even direcUy from mRNA by 
using different RN.n-dependent DN.--. polymerases. Recombination can be 
performed using singie-strar.dcd D.N.-\ ••.:mp!ates. 

2. In contrast to the DNA shuffling procedure, which requires 
fragmentation of the double-stranded DNA template (generally done with 
DNAse I) to generate ra.ndom fragm.ents, the technique described here employs 
random priming svmthesis to obtain DNA fragments of controllable si^e as 
■breeding blocl-.s" for iurxht: reassembly (FIG. 1). One immediate advantage is 
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that two sources of nuclease activity (DNasc I and 5'-3' exonuclease) are 
eliminated, and this allows easier control over the size of the final reassembly 
and amplification gene fragments. 

3. Since the random primers are a population of sjmthetic oligo- 
nucleotides that contain all four bases in every position, they arc uniform in 
their length and lack a sequence bias. The sequence heterogeneity allows 
them to form hybrids with the template DNA strands at many positions, so 
that every nucleotide of the template (except, perhaps, those at the extreme 5' 
terminus) should be copied at a similar frequency into products. In this way, 
both mutations and crossover may happen more randomly than, for example, 
with error-prone PGR or DNA shuffling. 

4. The random-primed DNA synthesis is based on the hybridization of a 
mixture of hexanucleotides to the DNA templates, and the complementary 
strands are synthesized from the 3'-OH termini at the random hexanucleotide 
primer using polymerase and the four deox>'nucleotidc triphosphates. Thus 
the reaction is independent of Lhe length of the DNA template. DNA fragments 
of 200 bases length can be primed equally well as linearized plasmid or ). DNA 
(29). This is particularly useful for engineering peptides, for example. 

5. Since DNase 1 is an cndonuclease that hydrol>-2es double-stranded 
DNA preferentially at sites adjacent to p^-rimidinc nucleotides, its use in DNA 
shufOing may rcsuU in bias (par-Jcularly for genes with high G+C or high A+T 
content) at the step of tc.Tiplatc gene digestion. Effects of this potential bias 
on the overall mutation rate and recombination frequency may be avoided by 
using the random- pricing approach. Bias in random priming due to 
preicrcntiaJ hybridization to GC-rich regions of the template DNA could be 
overcome by increasing the A and T content in the random oligonucleotide 
libraj^'. 

An im.portant part of practicing the present invention is controULng the 
averacre size of the nascent, smgle-strand DNA s>Tithcsized during the random 
priming process. This step has been studied in detail by others. Hodgson and 
Fisk (30) found that the average size of Lhc synthesized single-strand DNA is 
an inverse function of prim.cr concentration: length - k/ n^ItlPc . v;hcrc Pc is 
the primer conccntrauon. The inverse relationship between primer concen- 
tration and output DNA fragment size may be due to steric hindrance. Based 
on this guideline, proper conditions for random-priming s>mthesis can be 
readilv set for individual genes of diifercnt lengths. 
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Since dozens of pol>-mcrases axe currently available, synthesis of the 
short, nascent DNA fragments can be achieved in a variety of fashions. For 
example, bacteriophage T4 DNA polymerase (23) or T7 sequenase version 2.0 
DNA polymerase (31,32) can be used for the random priming synthesis. 

For single-stranded polynucleotide templates (particularly for RNA 
templates), a reverse transcriptase is preferred for random-priming synthesis. 
Since this enz>Tne lacks 3*->5' exonuclease activity, it is rather prone to error. 
In the presence of high concentrations of dNTPs and Mn2\ about 1 base in 

ever>' 500 is misincorporated (29). 

By modifying the reac^Jon conditions, the FCR can be adjusted for the 
random priming synthesis using thermbstable polymerase for the short, 
nascent DNA fragments. An important consideration is to identify by routine 
experimentation the reaction conditions which ensure that the short random 
primers can anneal to the templates and give sufficient DNA amplification at 
higher temperatures. V/e have found that random primers as short as dp(N)i2 
can be used with PGR to generate th= extended primers. Adapting the PGR to 
the random priming s>-nthcsis provides a convenient method to ma:<e short, 
nascent DNA fragments and ma:-:':s this random priming recombination 

tcch.niquc very robust. 

In many evolution sccnar^ios. recombination should be conducted 
bcr.veen oligonucleoddc ssqucr.ccs for v/hich sequence information is available 
for at least some of the template sequences, in such scenarios, it is often 
possible to define and s—ir.-:.-c a scries of primers which arc interspersed 
ber.veen the various mutaticns. When dcfmed primers arc used, tr.ey can be 
ber^.*een 6 and 100 bases long. In accordance wit!-, ihe present invention, it 
was discovered Lhat by alio-^^-ing these defined prLmers to initiate a series of 
overlappmg prLmer erctcnsion reactions (which m.ay be facUitated by 
thermocyclmg), it is possible to generate recombmation cassettes each 
containing one or m.orc of th.e accumulated mutauons, allelic or isotypic 
differences between templates. Using the defined prim.ers in such a way that 
overiapping extension products are generated in Lhe DNA pol>TnerizaUon 
reactions, c-daausuon of avzuiab:. cnmcr leads to the progressive cross- 
hybricizaiion of pnmer extend- products until complete gene products arc 
generated. The repeated rounds of annealing, extension and dcnamration 
sure recom.bination of each overlappmg cassette udth every other. 

A preferred embodiment of tr.e present invention involves methods in 
/hich a set of defmed oUgonucleotide prim.ers is used to prime DNA s>-nthesis. 
FIG. 2 illustrates an exemplar/ version of the present invention in which 
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defined primers are used. Careful design and positioning of oligonucleotide 
primers facilitates the generation of non-random extended recombination 
primers and is used to determine the major recombination (co-segregation) 
events along the length of homologous templates. 

Another embodiment of the present invention is an alternative 
approach to primer-based gene assembly and recombination in the presence 
of template. Thus, as illustrated in FIG. 3, the present invention includes 
recombination in which enzyme- catalyzed DNA polymerization is allowed to 
proceed only briefly (by limiting Lhe time and lowering the temperature of the 
extension step) prior to denaruration. Dcnacuration is followed by random 
annealing of the extended fragmenrs to template sequences and continued 
partial e.vtension. This process is r^rpeated multiple times, depending on the 
concentration of primer and template, until fu!) length sequences arc made. 
This process is called staggered extension, or StEP. .^though random primers 
can also be used for StEP, gene svTiihcsis is not nearly as efficient as ^ith 
defined primers. Thus defined primers are preferred. 

In this method, a brief annealing/extension step(s) is used to generate 
Lhe partially extended primer. A r/p:cai annealing/extension step is done 

under conditions which allow high Hdclir/ primer anncalLng (Tanncailng greater 

th.an Tm'~^), but limit the pol>*TTtcr'j:ation/cxtcnsion to no more than a few 
seconds (or an average c.->r.cns;on lo less than 300 nts). Minimum extensions 
arc preferably on the order of 20-50 nts. It has been demonstrated that 
th.ermostable DNA pol>'TT^. erases r,-p:cally exhibit ma:dmai pcl>Tneri2ation rates 
cf 100-150 nucicotidcs/second/enr^Tr.c molecule at optimal temperatures, 
but follow appro:cma:c .-^jrhenaus i-uncLics at temperatures approaching Lhe 

optimum temoerarurc (Top:). Thus, at a temperature of 55'C, a thermostable 
Doi>'mcrase exhibits only 20-25% of the steady state polymerization rate that it 
exhibits at 72'C [Topi), or 2--; nts/sccond (40). At 37'C and 22''C. Taq 
pol>Tncrase is reported to have extension activities of 1.5 amd 0.25 
nts/second, respectively (24). 3oth lime and temperature can be routinely 
altered based on the des'Lred recon^.binadon events and i-mowledge of basic 
Dolym erase i-tinetics and biochem:s*j-;.'. 

The progress of the staggered extension process is monitored by 
removing aliquots from, tine reaction rube at various time points in the primer 
extension and separating DNA fragments by agarose gel electrophoresis. 
E'.idence of effective primer extensio.n is seen from the appearance of a lo^v 
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molecular weight 'smear' eaxly in the process which increases in molecular 
weight with increasing cycle number. 

Unlike the gene amplification process (which generates new DNA 
exponentially), StEP generates new DNA fragments in an additive manner in 
its early cycles which contain DNA segments corresponding to the dilTcrcnt 
template genes. Under non-amplifying conditions, 20 cycles of StEP generates 
a maximal molar yield of DNA of approximately 40 times the initial template 
concentration. In comparison, the idealized polymerase chain reaction 
process for gene amplification is multiplicative throughout, giving a maximal 
molar yield of approximately 1 x lO^-fold through the same number of steps. 
In practice, the difference betv.'een the two processes can be observed by PGR. 
giving a clear 'band' after only a few (less than 10) cycles when starting with 
template at concentrations of less than 1 ng/ul and priir.ers a: lO-500-fold 

excess (vs. 10^-fold excess r^picaJ of gene amplificatioa). Under simUar 

reaction conditions, the StEP would be expected to give a less visible 'smear', 
which increases in molecular weight uvJi increasing number of c}-cles. When 
significant numbers of primer extended DNA moiccules begin to reach sizes of 
greater than 1/2 the length of the full length gene, a rapid jump in molecular 
weight occurs, as haJf-extcnded fo.'^vard and reverse strands begin to cross- 
hybndijjc to generate fragments neariy 2 times the size of those encountered 
to that point in the process. A: this point, consolidation of the smear into a 
discrete band of the appropriate rr.olccular weight can occur rapidly by either 
continuing to subject the D^'A to StE?, or altering the Licrmcc/cle to allo-.v 
complete extension of the primed DN.A to drive c:cponentiai gene amplificauon. 

Follov/ing gene assembly (and, if necessary, conversion to double 
stranded form) recombincd genes are ampILf":ed (op'uonal), digested with 
suitable restriction enjTjTncs and ligated into expression vectors for screening 
of the expressed gene products. The process caui be repeated if desired, in 
order to accumulate sequence changes leading to the evolution of desired 
funcUons. 

The staggered e.xtension ar.d homologous gene assembly process (StEP) 
represents a pov/erful, fle>dble meL-od for rccombining similar genes in a 
random or biased fashion. The process can be used to conccnLrate 
recombination within or a\va%' from sp-ccinc regions of a known series of 
sequences by controliing placement of primers and the time allov/cd for 
annealing/ e.Ktension steps. It can also be used to recombine specific cassettes 
of homiologous genetic information generated separately or witjiin a single 
reaction. The method is also applicable to recombinmg genes for which no 
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sequence information is available but for which functional 5' and 3' 
amplification primers can be prepared. Unlike other recombination methods, 
the staggered extension process can be run in a single tube using 
conventional procedures without complex separation or purification steps. 

Some of the advantages of the defined-primcr embodiments of the 
present invention are summarized as follows: 

1. The StEP method does not require separation of parent molecules from 
assembled products. 

2. Defmed primers can be used to bias the location of recombination 
events. 

3. StEP allows the recombinaiion frequency to be adjusted by varying 
extension times. 

4. The recombination process ca-n be carried out in a single tube. 

5. The process can be carried cue on single-stranded or double-stranded 
pol>Tiucleotides. 

6. The process avoids the bias introduced by DNasc 1 or other 
cndonucleases. 

7. Universal primers can be used. 

S. Defined primers c:chib'dng ii.T.ited randomness can be used to increase 
the frequency of mutation ai sc!cc:cd areas of the gene. 

As v/ill be apprcc:a:ed by ihosc sluUed in tr.e zsi, several embodiments 
of the present invenuon arc possible. Exemplary embodiments include: 

1. Recombination ar.d point muiauon of related genes using only defined 
fianVdng primers and staggered extension. 

2. Recombination and m.u-^ticn of related genes using flanking primers 
and a series of internal primers a: lev.' enough concentration that exhaustion 
of the primers will occur over tr.c course of the thcrmocy cling, forcing the 
overlapping gene fragments to crosr.-hybridize and extend until recombined 
s>mthctic genes are forrPiCd. 

3. RecombLnation ar.d mu*.a-jon of genes using random-sequence prim.ers 
at high concentration to generate a pool of short DN.-^ fragments which arc 
reassembled to form new genes. 
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4. . Recombination and mutation of genes using a set of defined primers to 
generate a pool of DNA fragments which are reassembled to form new genes. 

5. Recombination and mutation of single-stranded polynucleotides using 
one or more defined primers and staggered extension to form new genes. 

6. Recombination using defined primers with limited randomness at more 
than 30% or more than 60% of the nucleoUde posiUons within the primer. 

E.xamples of practice showing use of the primer-based recombination 
method are as follows. _ ' 

E?CMTPLE 1 

Use of denned flanking primers and staggered extension to 
recombine and enhance the thermosUbUity of subtilisin E 

This example shows how the defined primer recombination method can 

be used to enhance the thcrmostabilir/ of subtilisin E by recombination of t^vo 

genes known to encode sublilisi.-^. E variants thermostabilities exceeding 

that of wild-t>-pe subtilisin E. This e.xample demonstrates the general method 

outlined in FIG. 3 utilizL-.g or.ly r.vo primers corresponding to the 5' and 3' 

ends of the templates. 

As outlined in FiG. 3, c:-:tendcd recombination primers are first 
ecncrated by th.e suggcrcd extension process (StEP). which consists of 
repeated cycles of den^-.uration followed by e:<trcmely abbreviated 
a--.ncalinWe.xtensio.". s:e?(5), The e.T.ended fragments arc reassembled into 
full-lear^ genes by therrr.oc-.-clLng-assisted homologous gene assembly in Lhe 
presence of a DNA po!>-m.cras=, followed by an optional gene am.pllfication 
step. 

Two thermostable sub-JUsir. E mutants Rl and ?J. v^cre used to test the 
defined primer based rccombL-.ation technique using staggered extension. 
The positions at which these t^vo ger.es differ from one another are sho^vn in 
Table 1. Among the ten nucleotide positions that differ in Rl and R2, only 
those mutauons leading to a.-r.ino acid substirations Asn 181-Asp (NISID) and 
' Asn 213-Ser (N218S) confer tr.=r:T^.ostabi!it>-. The remaining mutations are 
neutral with respect to their effects op. thermostability (33). The half-lives at 
65°C of the single variants NISID a.-.d N218S arc appro>dmately 3-fold and 2- 
fold greater th.an that of wild r,-pe subtilisin E, respectively, and their melting 
temoeratures, Tm. are 3.7°C ar.d 3.2'C higher than that of wild type enzyme. 
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respectively. Random recombination events that yield sequences containing 
both these functional mutations will give rise to enzymes whose half lives at 
65'C axe approximately 8-fold greater than that of wild type subtiiisin E, 
provided no new deleterious mutations are introduced into these genes during 
the recombination process. Furthermore, the overall point mutagenesis rate 
associated with the recombination process can be estimated from the catalytic 
activit}' profile of a small sampling of the recombined variant library. If the 
point mutagenesis rate is zero, 25% of the population should exhibit wild type- 
like activity, 25% of the population should have double mutant 
(N181D+N218S)-like activity and the remaining 50% should have sLnglc 
mutant (N181D or N218S)-like activity. Finite point mutagenesis increases 
the fraction of the librar>' Lhat encodes en2>'mcs v,-ith vvild-c>'pc like (or lower) 
activicy. This fraction can be used to estimate the point mutagenesis rate. 

TABLE 1 

DNA and amino acid substitutions in theTmostablc 
subtiiisin E mutants Rl and R2. 







Base 


Position 




Amino acid 




Base 


Substitution 


ip. codon 


Amino acid 


subsutudon 




780 


A -> G 


2 


109 


Asn-yScr 


Rl 


1107 


A G 


2 


218 


Asa->Ser 




1 Kl 


A -+ T 


3 


229 


s>Tionymous 




I 153 


A 0 


3 


233 


syTionvmou s 




484 


A G 


3 


10 


synonymous 




520 


A -> T 


3 


22 


svnon%Tnous 




593 


A — > G 


3 


48 


s>Tion>Tnous 




731 


G A 


1 


93 


Val->lle 


R2 


745 


T -> C 


3 


97 


s>Tionymous 




780 


A G 


2 


109 


Asn->Scr 




995 


A -> G 


I 


181 


Asn->Asp 




1159 


A -> G 


3 


245 


synonymous 


Mutations 


listed 


are relative to v.-iid 


r/pc subtiiisin 


E with base 


substitution at 


7S0 in common. 
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Materials and Methods 

Procedure for defined primer based recombination using two flanking primers. 

Two defined primers, P5N (5'-CCGAG CGTTG CATAT G TGGA AG-3' 
(SEQ. ID. NO: 1), underlined sequence is Ndel restriction site) and P3B (5*- 
CGACT CTAGA GGATC C GATT C-3* (SEQ. ID. NO: 2), underlined sequence is 
BamHI restriction site), corresponding to 5' and 3' Qanking primers, 
respectively, were used for recombination. Conditions (100 ul final volume): 
0.15 pmol plasmid DNA containing genes Rl and R2 (mixed at 1:1) were used 
as template, 15 pmol of each flanking primer, 1 times Taq buffer, 0.2 mM of 
each dNTP, 1.5 mM MgCb and 0.25 U Taq polymerase. Program: 5 minutes of 
95'*C, 80 cycles of 30 seconds 94''C, 5 seconds 55**C. The product of correct 
size (approximately Ikb) was cut from an 0.8% agarose gel after 
electrophoresis and purified using QIAEX 11 gel extraction kit. This purified 
product was digested with Ndel and BamHI and subcloncd into p3E3 shuttle 
vector. This gene library was amplified in E. coli HBlOl and transferred into 
B. subtilis D342S competent cells for expression and screening, as described 
elsewhere (35). 

DNA sequencing 

Genes v.'crc puri:led using QIApr = p spin plasmid miniprcp ki: to obtain 
sequencing quaiiry DS.\. Sequencing was done on an ABl 373 DNA 
Sequencing System using the Dye Termina:or Cycle SequencLng Vut (Perkin- 
Eimer, Branchburg. NJ). 

Results 

The p.^ogrcss of Lhe staggered extension v/as monitored by remo'/ing 
aliquots (10 ul) from the rcacuon Cube at various time points in the primer 
extension process and sepaLrating DN.A fragments by agarose gel 
electrophoresis. Gel electrophoresis of primer erctcnsion reactions revealed 
that annealing/extension reactions of 5 seconds at 55'C resulted in the 
occurrence of a smear approaching 100 bp (after 20 cycles). 400 bp (after 40 
cycles). 800 bp (after 60 cycles) ar.d finally a strong approximately 1 kb band 
v.nthiin this smear. This band (n^.brr^rc of reassembled products) was gel 
'purified, digested with restricticn crsr-^-t BamHI and Ndel, and ligated ^-ith 
vector generated by 5amHl-;Vcei diges-Jon of the £^ colt / B. subtilis p3E3 
shuttle vector. This gene library' was amplified in E. coli HBlOl and 
Lransferrcd into B. subtilis D3423 competent cells for expression and 
screening (35). 
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The thermostability of enzyme variants was determined in the 96-weU 
plate format described previously (33). About 200 clones were screened, and 
approximately 25% retained subtilisin activity. Among these active clones, the 
frequency of the double mutant-Like phenotype (high thermostability) was 
approximately 23%, the single mutant-Like phenotype was approximately 42%, 
and wild typc-Oke phenotype was approximately 34%. This distribution is 
very close to the values expected when the two thermostable mutations N218S 
and N181D can recomblne with each other completely freely. 

Twenty clones were randomly picked from E. coli HBlOl gene library. 
Their plasmid DNAs were isolated and digested with Ndel and BamHL Nine 
out of 20 (45%) had the inserts of correct; size (approximately 1 kb). Thus, 
appro.'cimately 55% of the above library had no activity due to lack of the 
correct subtilisin E gene. These clones are not members of the subtilisin 
library and should be removed from our calculations. Taking into account 
this factor, we find that 55% of the library (25% active cloncs/45% clones with 
correct size insert) retained subtilisin activit>'. This activity profile indicates a 
point mutagenesis rate of less than 2 mutations per gene (36). Five clones with 
inserts of the correct size were sequenced. The results are summarized in FIG. 
4. All five genes arc recombination products v^-ith minimum crossovers 
varying from I to 4. Only one ne-:^* point mutation was found in these five 



cenes. 



Use of dcHncd flan'rdng primers and stnggcrcd extension 
to rccombine pNB esterase mutants 

The nvo-primcr recombination m.ethod used here for pN3 esterase is 

analogous to that described in Example 1 for subtilisin E. Two template pNB 

esterase mutant genes that differ at 14 bases are used. Both templates (61C7 

and 4G4) are used in the plasmid fonn. Both target genes are present in the 

extension reaction at a concentraaon of 1 ng/ul. Flanldng primers (RMIA and 

?J^12A. Table 2) are added at a final concenL-ation of 2 ng/ul (appro. "dm ately 

200-fold molar excess over templaic). 
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TABLE 2 

Primers used in the recombination of the pN13 esterase genes 
Primer Sequence 

RM 1 A GAG CAC ATC AGA TCT ATT AAC (SEQ. ID. NO: 3) 

RM2A GGA GTG GCT CAC ACT CGG TGG (SEQ. ID. NO: 4) 

Clone 61C7 was isolated based on its activity in organic solven: and 
contains 13 DNA mutations vs. the wild-type sequence. Clone 4G4 was 
isolated for thermostability and contains 17 DNA mutations when compared 
with wild-type. Eight mutations are shared between them, due to common 
ancestry. The gene product from 4G4 is significantly m.ore thermostable than 
the gene product from 61C7. Thus, one measure of recombination between 
the genes is the co-segregadon of the high solvent activity cind high 
thermostability or the loss of boLh properties in the recombincd genes. In 
addiuon. recombination frequency ar.d m.utagenic rate can be ascertained by 
sequencing ramdom clones. 

For the pN3 esterase gene, pruner e.^tension proceeds through 90 
rounds of extension v/iih a ihzrrr.oc-yclc consisung of 30 seconds at 94'C 
foliowed by 15 seconds at 55'C. Aiiquots (10 pi) are removed following cycle 
20. 40, 60. 70, 80 an'd 90. Agarose gel electrophoresis reveals the formation 
o: a low molecular v/cigh: 's.-r.car' by cycle 20. which increases in average size 
and overall intensir/ a: each successive sample point. By cycle 90, a 
pronounced smear is cvidcr.: er-r.cnding from 0.5 kb to 4 kb, and exhibiting 
ma:'dmal signal intensir/ a: a size of approximately 2 kb (the length of the full 
length genes). The jump from half-length to full length genes appears to occur 
berwcen cycles 60 arid 70. 

The intense sm.ear is am.plificd through 6 cycles of polymerase chain 
reaction to more clearly define Lhe full length rccombined gene population. A 
minus-primer control is also amplified with flanking primers to determine the 
background due to residual template in the reaction m.Lx, Band intensity from 
the primer e.xtendcd gene populauan exceeds that of the control by greater 
than 10-fold, Lndicating that arr.plifcd. non-recombined , template comprise 
only a small fraction of the a-mplified gene population. 

The amplified recombmcd gene pool is digested with restriction 
enz^.-mcs Xbal and BamHl and ligated into the pNB106R expression vector 
described by Zock et al. (35). Transfonnation of ligated DNA into E. coli strain 
TGI is done using the v.'ell characterized calcium chloride transformation 
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procedure. Transformed colonies are selected on LB/agar plates coniainmg 

20 |ig/ml tetracycline. 

The mutagenic rate of the process is determined by measuring the 
percent of clones expressing an active esterase (20). In addition, colonies 
picked at random arc sequenced and used to defme the mutagenic frequency 
of the method and the efficiency of recombination. 

EXAJVTPLE 3 

RccombinaUon of pNB esterase genes using interspersed 
internal defined primers and staggered extension 

This e.xample demonstrates that .the interspersed defined primer 

recombination technique can produce novel sequences through point 

mutagenesis and recombinadon of mutations present in the parent 



sequences. 



E7q)crimcntal design and background information 

Tu-o pN3 esteras= genes (2-13 a.nd 5-312) were recombined using the 
dcfmed primer recombinadon techr.iquc. G:ne products from both 2-13 and 
5-B12 arc measurably more ih=r.:^.os:abl= ihan wild-t>Te. Gene 2-13 contains 
9 muiaaons no^ originally prescr.-. in, the v.-i!d-ryp= sequence, while gene 5- 
B12 contains K. The posidons a: which dncsc r.vo genes differ from one 
„ « " " ZS C S 0 P. i F 1 0 . o 

Table 3 shov.^s the seq'^er.ces of the eight primers used in this e.xample. 
Locadon (at M-.z o end of the template gene) of oHgo anneaJing to the template 
genes is indicated in the table, as is primer orientation (F indicates a forv.-ard 
primer, R indicates reverse). These primers arc shown as arrows along gene 
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TABLE 3 

Sequences of primers used in this example 



name 


orientation 


location 


sequence 


RMIA 


F 


-76 


GAGCACATCAGATCTATTAAC (SEQ. ID. NO: 3) 


RM2A 


R 


+454 


GGAGTGGCTCACAGTCGGTGG (SEQ. ID. NO: 4i 


S2 


F 


400 


TTGAACTATCGGCTGGGGCGG (SEQ. ID. NO: 5) 


S5 


F 


1000 


TTACTAGGGAAGCCGCTGGCA (SEQ. ID. NO: 6) 


S7 


F 


1400 


TCAGAGATTAGGATCG.AA^AC (SEQ. ID. NO: 7) 


SS 


R 


1280 


GGATTGTATCGTGTGAGAAAG (SEQ. ID. NO: 8) 


SIO 


R 


880 


AATGCCGGA^GCAGCCCCTTC (SEQ. ID. NO: 9; 


S13 


R 


280 


CACG.^CAGGAAGAi"rriGACT(SEQ. ID. NO: IC' 



Materials and Methods 

Defined-primer based recombination 

1. Prcparadon of genes to be recorrvbincd. Plasmids containing &.t genes 
to be recombined were purified from iransfornicd TGI cells using Lne Qiaprcp 
kl: (Qiagcn, ChatsworLh, CA). Plasmids were quandtatcd by LTV absorption 
and mixed 1:1 for a final concenLra^ion of 50 ng/uL 

2. Staggered e:<:ension PGR ar.d reassembly. 4 ^\ of ihc plasmid mL*aurc 
was used as template in a 100 j^i standard reaction (1.5 mM MgCb. 50 mM 
KCl. 10 mM Trls-HCl pH 9.0. 0.1% Triton X-100. 0.2 mM dNTPs. 0.25 U Taq 
polvmcrase (Prom.cga. Madisor., Vvl)) which aJso contained 12.5 ng of each of 
Lhe 8 prlm.crs. conu-c! rcac'Jon which contained no prim.ers was also 
assembled. Rcac^ons were Lh.crm^ocyclcd li-.rough 100 cycles of Q^'C. 30 
seconds: 55'C. 15 seconds. Chec "g ar. aiiq-JOt of the reaction on an agarose 
gel at Lhis poLnt showed the product to be a large smear {vAih no visible 
product in the no primer control). 

3. Dpnl digestion of the templates. I ^1 L'om the assembly reactions was 
then digested with Dpnl to rem.ove the tem.piatc plasmid. The 10 pJ DpaJ 
digest contained 1 x NEBuffer 4 ar.d 5 U Dpnl (both obtaLned from New- 
England Biolabs, Beverly, ar.d was Lncubated at Zl'C for 45 mLnutes, 
followed by Lncubation at 70=C for 10 minutes to heat kill the cnz>nme. 

4. PGR amplifcation :f tr.e reassembled products. The 10 ul digest was 
then added to 90 -1 of a standard PGR reaction (as described in step 2) 
containing 0.4 uM primers 5b (ACTT.^-.TCTAG.^GGGTATTA) (SEQ. ID. NO: 11) 
and 3b (AGGCTCGCGGGATCCCCGGG) (SEQ. ID. NO: 12) specific for inz ends 
of the gene. After 20 cycles c: standard PGR (94'»C. 30 seconds: -8'G, 30 
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seconds, 72°C. 1 minute) a strong band of the correct size (2 kb) was visible 
when the reaction was checked on an agarose gel, while only a very faint band 
was visible in the lane from the no-primer control. The product band was 
purified and cloned back into the expression plasmid pNBI06R and 
transformed by electroporation into TGI cells. 

Results 

Four 96 well plates of colonies resulting from this transformation were 
assayed for pNB esterase initial acdvity and thermostability. Appro.>dmately 
60% of the clones exhibited biual activity and thermostabilty within 20% of 
the parental gene values. Very few (10%) of ihc clones were inactive (less than 
10% of parent initial activity values). These results suggest a low rate of 
mutagenesis. Four mutants with the highest thermostability values were 
sequenced. Two clones (6Ed and 6H1) were the result of recombLnation 
bet^veen the parental genes (FIG. 5). One of the remaining t^vo clones 
contained a novel point mutation, and o.-ie showed no difference from parent 
5B12. The combination of mutations T99C and C204T in mutant 6E6 is 
evidence for a recombination event bctNvccn these t^vo sites. In addition, 
mutant 6H1 shows the loss of mutation A1072G (but the retention of 
mutations C103ST and T1310C), uhich is e^^•d=nce for tv,-o recombination 
events (one bet^veen sites 1023 and 1072. and another bet^veen 1072 and 
, , » .-..I _r r...„ r^o-^r r-T-rarlnn?; were found in the four genes 

sequenced. 

EXAJ.IPLE 4 

Recombination of tvo thermostable subtilisin E variants 
using internal dcfmed primers and staggered cTrtcnsion 

This example demonsL-ates that the defined primer recombination 

technique can produce novel sequences containing new combinauons of 

mutations present in the parent sequences. It further demonsL-ates the utility 

of the defined primer recombination technique to obtain further Lmprovements 

in enzyme performance (here. them'.ostabilit>'). This example further shows 

that the defined primers can bias tr.c recombination so that recombination 

•appears most often in th.e portion of the sequence defmed by the prime.-s 

(inside the primers). Furth.-jrmore. iKis example shows that specific mutations 

can be introduced into the recombined sequences by using the appropriate 

defined primer sequence(s) contai.nLng the desired mutauon(s). 
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Genes encoding two thermostable subulisin E variants of Example 1 
(Rl and R2) were recombined using the defined primer recombination 
procedure with internal primers. FIG. 6 shows the four defmed interna] 
primers used to generate recombined progeny genes from template genes Rl 
and R2 in this example. Primer P50F contains a mutation {A->T at base 
position 598) which eliminates a Hindlll restriction site and simultaneously 
adds a new unique Nhel site. This primer is used to demonstrate that specific 
mutations can also be introduced into the population of recombined 
sequences by specific design of the defmed primer. Gene R2 also contains a 
mutation A->G at the same base position, which eliminates the Hindlll site. 
Thus restricdon analysis (cutting by Nhel and Hindlll) of random clones 
sampled from the recombined library will indicate the efficiency of 
recombination and of the introduction of a specific mutation via the mutagenic 
primer. Sequence analysis of randomly-picked (unscreened) clones provides 
further information on the recombination and mutagenesis events occurring 
during defined primer-based recombLnation. 

Materials and Methods 

Defined' primer based recombinciion 

A version of the defined printer based recombination illustraced in FIG. 
2 was caj-ried out with the addiLion of StEP. 

1. Prcoaration of genes :o be recombined. About 10 ug of plasmids 
containing Rl and ?v2 gene ^ve^e digested at 37*'C for 1 hour with Nde! and 
SamHI (30 U each) in 50 of Ix buffer B (Boehringcr Mannheim, 
Indianapolis, IN). Inserts of apprordmalely 1 kb were purified from 0.8Vo 
preparative agarose gels usLng QL^EX II gel extraction 1-dt, The DNA inserts 
were dissolved in 10 mM Tris-HCl (pH 7,4). The DNA concentrations were 
estimated, and the inserts were mLxed 1:1 for a concentration of 50 ng/ul. 

2. Staggered extension PGR and reassembly. Conditions (100 ul fmaJ 
volume): about 100 ng inserts v.-rc used as template, 50 ng of each of 4 
internal primers, Ix Taq buffer, 0.2 mM of each dNTP, 1.5 mM MgCb and ,25 
U Taq pol>Tnerase. Program: 7 cycles of 30 seconds at 94°C, 15 seconds at 
55°C, followed by another 10 cycles of 30 seconds at 15 seconds at 
SS'C, 5 seconds at 72X (staggered extension), followed by 53 c>xlcs of 30 
seconds at 15 seconds at 55'C, 1 minute at 72*C (gene assembly). 

3. Dpnl digestion of the templates. 1 pi of this reaction was diluted up to 
9.5 ]^\ with dUiO and 0.5 pi of Dpnl restriction enzyme was added to digest the 
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DNA template for 45 minutes, followed oy incubation at 70"C for 10 minutes 
and then this 10 ul was used as template in a 10-cycle PGR reaction. 
4. PGR amplification of reassembled products. PGR conditions (100 pi 
final volume): 30 pmol of each outside primer P5N and P3B, Ix Taq buffer, 0.2 
. mM of each dNTP and 2.5 U of Taq polymerase. PGR program: 10 cycles of 30 
. seconds at 94*C, 30 seconds at 55*G, 1 minute at 72X. This program gave a 
single band at the correct size. The product was purified and subcloncd into 
... pBE3 shuttle vector. This gene library was amplified in £. coli HBlOl and 
transferred into B. sublilis DB428 competent cells for expression and 
screening, as described elsewhere (35). Thermostability of enzyme variants 
was determined in the 96-weIl plate format.describcd previously (33). 

DNA sequencing 

Ten £, coli HBlOl transformants were chosen for sequencing. Genes 
were purified using QLAprcp spun plasrrad miniprep kit to obtain sequencing 
qualiry DNA. Sequencing was done cn an ABI 373 DNA Sequencing System 
using the Dye Terminator Cycle Scc/j cncir.g l-'it (Pcrkin-Elmer, Branchburg. 
NJ). 

Results 

Ij restriction analysis: 

Forv/ clones ra-".dorr.ly picked irorr. the rccombined library were 
digested v/lth rcs^'icrjon enr-^.T-.es lihel a-nd ^amHl. In a separate experiment 
the same ferry plasties v.-erc digested -^v-.th Hindlll and BamHI. These 
reaction products v.cre ar.aiyzed by gel electrophoresis. As shown in FIG. 7, 
eight out of 40 clones (appro;c"a:e:y 20%) contain the newty introduced jYhei 
restriction site, demonsirating ih.at tr.e mutagenic primer has indeed been 
able to introduce the specified mutation into the population. 

2) DiiA sequence analysis 

The first ten randomJy picked clones were subjected to sequence 
analysis, and th.c results are surr.rT'.anzed in FIG. 8. A minimum of 6 out of 
the 10 genes have undergone rc:c ir.ation. Among these 6 genes, the 
minimal crossover events (reccrr.bir.:::;: ber.veen genes Rl and R2 vary from 
' 1 to 4. /^Jl visible crossovers cccurreii v;ithin the region defmed by the four 
primers. Mutations outside this region are rarely, if ever, recombined, as 
shown by the fact that there is no recombination between the two mutations 
at base positions 484 and 520. These results show that the defmed primers 
can bias recombination so that i: appears most often in the portion of the 
sequence defined by the prim.ers (inside tr.e primers). Mutations very close 
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together also tend to remain together (for example, base substitutions 731 and 
745 and base substitutions 1141 and 1153 always remain as a pair). 
However, the sequence of clone 7 shows that two mutations as close as 33 
bases apart can be recombined (base position at 1107 and 1141). 
5 Twenty-three new point mutations were introduced in the ten genes 

during the process. This error rate of 0.23% corresponds to 2-3 new point 
muUtions per gene, which is a rate that has been determined optimal for 
generating mutant libraries for directed enzyme evolution (15). The mutation 
types are listed in Table 4. Mutations axe mainly transitions and are evenly 
10 distributed along the gene. 

TABLE 4 



20 



New poi 


nt mutations identified 


in ten recombined 


genes 


Transiuon 


Frequency 


Transversion 


Frequency 


G A 


4 


A -> T 


1 


A -> G 


4 


A C 


I 


C -> T 


3 


C A 


1 


T-^ C 


5 


C G 


0 






G -> C 


1 






G -> T 


0 






T -> A 


3 . 






T-> G 


0 



A tocal of 9S60 bases v.'cre sequenced, i he mutadon raie was 0.23% 

4j Phenotypic analysis 

Approximately 450 B, subtilis D3428 clones were picked and grown in 
SG medium supplemented \^'ith 20 ug/ml kanamycin in 96-wcll plates. 
Approximately SoVo of the clones e.>:presscd active enz>'mes. From previous 
e.XDericnce. we knovv- Lhat this level o: inactivation indicates a mutation rate on 
the order of 2-3 mutations per ger.e (35). Approximately 5% clones showed 
double mutant (N151D+N218S)-iikc phcnot>i3es (which is below the expected 
25% value for random reco-binaiion alone due primarily to poLnt 
mutagenesis). (DN.A sequencL-.g showed Lhat two clones, 7 and 8, from the ten 
randomly picked clones contain boih. N21SS an.d NISID mutations.) 



25 
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EXAMPLE 6 

Optimization of the Actinophxtxes utahensis ECB dcacylase by 
the random-priming recombination method 

In this example, the method is used to generate short DNA fragments 
from denatured, linear, double-stranded DNA (e.g., restriction fragments 
puriiled by gel electrophoresis; 22). The purified DNA, mixed with a molar 
e.xcess of primers, is denatured by boiling, and synthesis is then carried out 
using the Klenow fragment of E. coli DNA polymerase I. This enzyme lacks 
5'->3' exonuclease activity, so that the random priming product is synthesized 
exclusively by primer extension and is not degraded by exonuclease. The 
reaction is carried out at pH 6.6, where the 3*->5' exonuclease accivit>- of the 
enz\^s is much reduced (36). These conditions favor random initiation of 
s>Tithesis. 

The procedure involves the following s:cps: 

1, Cleave the DNA of in:crcs: with appropriate rcstricdon 
endonuclease(s) and purL^ the DNA fragment of interest by gel electrophoresis 
using V/Lzard PGR Prep Kit (Promcga. Madison. Vvl), As an example, the 
Actinoplanes utahensis ECS deacyiasc gene was cleaved as a 2.4 kb-long Xho 
I-Psh AI fragment from the recombina::: piasn-.id pSHPlOO. It was essential to 
linea--Lzc the DNA for one subscq-jcn: cenaruration step. The fragment was 
p'^rS:zd by agarose gc! eiccircpho.'csls using tr.e V/izard PGR Prep Kit 
(Promega. Madison, V/1) (FIG. 9, step (a)). Gel purification was also essentia! in 
order to remove "tr.e res'jnction endonucieasc buffer from line DNA, since the 

N!g-* ions mal-:c \\ dilTicul: to denature the DNA in the next step. 

2. 400 ng (about 0.51 p-o\) of th.e double-stranded DNA dissolved 
in HiO was mixed \^nth 2.75 pg (about 1.39 nmol) of dp(N)6 random primers, 
.'^iter immersion in boiling 'A-ater for 3 m.inutes. the mucrure was placed 
im.mediately in an ice/ethanol ba*Jn. 

The si2e of the random priming products is an. inverse function of the 
concentration of prim.er (33). The presence of high concentrations of primer is 
thought to lead to steric hindrance. Under the rcacdon conditions described 
.here the random prim.ing produces are appro:dmately 200-400 bp, as 
determixied by electrophoresis through ?ui al!-:alLnc agarose gel (FIG. 9 step b). 

3. Ten pi of 10 x reaction buffer [lOX buffer: 900 mM HEPES, pH 
6.6; 0.1 M magnesium chloride, 10 mNt dithiothreitol, and 5 mM each d-ATP. 
dCTP, dOTP and dTTP) was added to the denatured sample, and the total 
volume of the reaction mbcrure was brought up to 95 u\ with H:!0. 



wo 98/42832 



PCT/US98/05956 



^'9 

4. Ten units (about 5 pi) of the Klenow fragment of E.coli DNA 
polymerase I was added. All the components were mixed by gently tapping the 
outside of the tube and were centrifuged at 12,000 g for 1-2 seconds in a 
microfuge to move all the liquid to the bottom. The reaction was carried out at 
22'*C for 35 minutes. 

The rate of the extension depends upon the concentrations of the 
template and the four nucleotide precursors. Because the reaction was 
carried out under conditions that minimize exonucleolytic digestion, the newly 
synthesized products were not degraded to a detectable extent. 

5. After 35 minutes at 22''C, the reaction was terminated by cooling 

Lhe sample to O^C on ice. 100 \x\ of ice-cold H2O was added to the reaction 
mLxTure. 

6. The random primed products were purified by passing the whole 
reacuqn mLxture through CenL-icon- 100 (to remove the template and proteins) 
and Centricon-10 filters (to remove Lhs primers and fragments less than 50 
bases), successively. Centricon filters are available from Amicon Inc (Bervcrly, 
M.^j. The retentate fraction (abcu: S5 pi in volume) was recovered from 
Ceniricon-lO. This fraction contained the desired random priming products 
(FIG. 9, step c) and v/as used for whoic gene reassembly. 

Reassembly of th.c wriole gene was accom.pUshed by the following steps: 

1. For reassembly by PGR, 5 yl of the random-primed DN.^ 
fragments fro.m CenLncon-IO, 20 pi of 2m PGR prc-mLx (5-fold diluted cloned 
P/j buffer. 0.5 m.M each dNT?, O.IU/pl cloned Pfn pobmcrase (Stratagene, U 
Jolia. GA)). 8 pi of 30% (v/v) glycerol and 7 pi of H3O were mixed on ice. Since 
the concentration of the random-prLmed DNA fragments used for reassembly 
is the m.ost Important variable, it is useful to set up several separate reactions 
with different concentrations to establish the preferred concentration. 

2. After incubation at 95°G for 6 minutes. 40 Lhcrmocycles were 
penormed, each with 1,5 minutes at 95^C, 1.0 minutes at SS'G and 1.5 
m.Lnutes + 5 second/cycle at 72^0, vs-i:h the extension step of the last cycle 
proceedLng at 72^C for 10 mLnutcs, in a DNA Engine PTC-200 (MJ Research 
Inc., Wateaown, MA) apparatus wiLnout addLng any mineral oil. 

3. 3 pi aliquots at cycles 20. 30 and 40 were removed from the 
reaction mixture and analyzed by agarose gel electrophoresis. The 
reassembled PGR product at 40 cycles contained the correct size product in a 
smear of larger and smaller sizes (see FIG. 9, step d). 



wo 98/42832 



PCT/US98/05956 



0 

The correctly reassembled product of this first PGR was further 
amplified in a second PGR reaction which contained the PGR primers 
complementary to the ends of the template DNA. The amplification procedure 
was as follows: 

1. 2.0 |il of the PGR reassembly aliquots were used as template in 
100-pl standard PGR reactions, which contained 0.2 mM each primers of 
xhoF28 (5* GGTAGAGCGAGTGTCGAGGGGGAGATGG3*) (SEQ. ID. NO: 13) and 
pshR22 (5* AGCCGGGGTGACGTGGGTCAGC 3') (SEQ. ID. NO: 14). 1.5 mM 
MgCb, 10 mM Tris-HGl (pH 9.0], 50 mM KCl, 200 pM each of the four dNTPs, 
6% (v/v) glycerol. 2.5 U of Tag polymerase {Promega, Madison, WI) and 2.5 U 
of PfvL polymerase (Stratagcne, La Jolla, CA), 

2. After incubation 96'G for 5 mLnutes, 15 Lhermocycles were 
performed, each with 1.5 minutes at 95'C. 1.0 minutes at 55»C and 1.5 
minutes at 72''C, followed by additionai 15 lhermocycles of 1.5 minutes at 
95«C. 1.0 minutes at SS^C and 1.5 minutes + 5 second/cycle at 72'G v.-^:th the 
extension step of the last cycle proceeding at 72'G for 10 m.inutes. in a DNA 
Engine PTC-200 (MJ Research Inc.. Water.own. MA) apparatus without adding 
any mineral oil. 

3. The amplificauon resulted in a large amount of PGR product 
%vith the correct size of th.c EC3 dcar/lase whole gene (FIG. 9. step e). 

Cloning v/as accomplished as foUo'^'s.; 

1. The PGR oroduc- of EC3 deacylase gene was digested with Xho I 
and Psh Al resu-iction enr.TT.cs, a-nc cloned in-o a modified pIJ702 vector. 

2. S. /iVi'dans TK23 protoplasts were transform.ed with the above 
U^'ation mlxrure to form a mutant librar>'. 

fn .-^iTu screenine the ECB deacvlas: rr. utants 

Each iransformant wiihin ihe S. (iVJdaas TK23 library obtained as 
described above was screened lor deacylase acti-.-ity wth an in stfu plate assay 
method using ECB as substrate. Trar.sforrr.cd protoplasts were allowed to 
regenerate on R2YE agar plates by incubation at 30'C for 24 hours and to 
develop in the presence of th.ios--rcp;on for furJier 4S-72 hours. When the 
colonies grew to proper sL-.e. 6 ml of 45=C purified-agarose (Sigma) solution 
contaLning 0.5 mg/m! ECB in 0.1 M sodium acetate buffer (pH 5.5) was 
poured on top of each R2YE-agar plate and allowed to further develop for 18- 
24 hours at 37°C. Colonies surrounded by a clearing zone larger than that of a 
control colony containLng v.-ild-t:.-pe recombLnant plasmid pSHP150-2 were 
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indicative of more efficient ECB hydrolysis resulting from improved enzyme 
properties or improved enzyme expression and secretion level, and were 
chosen as potential positive mutants. ■ These colonies were picked for 
subsequent preservation and manipulation. 

HPLC assay of the ECB deacvlase mutants 

Single positive transformants were inoculated into 20 ml fermencation 
medium containing 5 pg/ml Lhiostrepton and allowed to grow at SO^-C for 48 
hours. At this step, ail cultures were subjected to HPLC assay using ECB as 
substrate. 100 pi of whole broth was used for an HPLC reaction at aO'^C for 30 
minutes in the presence of 0.1 M NaAc (pH 5.5). 10% (v/v) McOH and 200 
pg/ml of ECB substrate. 20 pi of each reaction mixture was loaded onto a 
PolyLC polyhydroxyechyl aspartamide column (4.6 x 100 mm) and cluied by 
aceconitrile gradient at a Ho-.v rate of 2.2 ml/min. The ECB-nucleus was 
detected at 225 nm. 

Punfication of the ECB deacvlase r-.ut^nts 

After the HPLC assay. 2.0 rr.l pre-culcures of all potential positive 
mutants were then used to inoculate 50-ml fermentation medium and allowed 
to grow at 30^C, 280 rpm for 95 hours. These 50-ml cultures were then 
ccntrifugcd a: 7,000 g for 10 minutes. The supematants were re-ccntrifugcd 
a: 16,000 g for 20 mmutcs. The supcmatants containmg the ECB dear/lase 
mu:ant cnz\Tr.es were s:ored a: -20'*C. 

The supematants from ir.e posidvc mutants were furth.er concentrated 
to 1/30 th.eir original volum.e %^-ith an ;^J7iicon filtration unit with m.olecular 
weight cutoff of 10 kD. The resuldng enr.'m.e samples were diluted v.-ith an 
equal volume of 50 mM KH2?04 (?H 6.0) buffer and 1.0 ml was applied to Hi- 
Trap ion exchange column. The bindmg buffer was 50 mM KH:PO. (pH 6.0). 
and the elution buffer was 50 mM KH^PO. (pH 6.0) and 1,0 M NaCl. A linear 
^adient from 0 to 1.0 M NaCl v.-as applied in 8 column volum.es with a How 
rate of 2.7 ml/mLn. The ECB cear/lase mutant fraction eluted at 0.3 M NaCi 
and was concentrated and buffer c:--r.anged into 50 mM KH2PO. (pH 6.0) in 
^ Amicon CenLncon-lO units. Er-.-v: punc>' was veriued by SDS-PAGE, and 
the concen'^-ation was determined using the Bio-Rad Protein Assay. 

Specific activity assav of the ECB d e?.cvlase mutants 

4.0 pg of each purified ECB deacylase mutant was used for the activity 
assav at 30'C for 0-60 munutes in the presence of 0.1 M Na.^c (pH 5.5). 10% 
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(v/v) MeOH and 200 fig/ml of ECB substrate. 20 ^ll of each reacuon mixture 
was loaded onto a PolyLC polyhydroxyethyl aspartaxnide column (4.6 x 100 
mm) and eluted with an acetonitiile gradient at a flow rate of 2.2 ml/min. The 
reaction products were monitored at 225 nm and recorded on an IBM PC data 
acquisition system. The ECB nucleus peak was numerically integrated and 
used to calculate the specific activity of each mutant. 

As shown in FIG. 10, after only one round of applying this random- 
priming based technique on the %v-ild-type ECB deacylase gene, one mutant 
(M16) from 2,012 original transformants was found to possess 2.4 times the 
specific acuvity of the wild-type enzyme. FIG 11 shows that the activity of 
M16 has been increased relative to chat of. the wild-type enzyme over a broad 
pH range. 

EXAMPLE 7 

Improving the thcrmostibiUty Bacillus sufatilLs subtilisin E 
using the random-sequence primer recombination method 

This e.xajnple d=monsL-a:cs th.c use of various DNA polymerases for 
primer-based rccombmation. 1: furths.' demonstrates the stabiUzadon of 
subtilisln E by recombination. 

Genes Rl and R2 cncod-.g tr.e t^vo th.ermostable subtilisin E variants 
described in E.-:amplc 1 were chosen as the icri?la:cs for rccombinadon. 

(I) Tarcei gene preparatior. 

Subtilisin Z th.cr.-r.or_.blc .-r.u:anl genes R! and R2 (FIG. 11) were 
subjected to random pp--n=d DV^h s>-nthesis. The 9S6-b? fragment ir.cludmg 
45 ni of subdlisLn S prosequcncc. the entire mararc sequence and 1 13 nt after 
the stop codon v.-cre obtained by double digestion of plasmid pBE3 with Bam 
HI and Nde 1 and punned from a 0.8% agarose g=! using the Wizard PGR Prep 
Kit (Promega, Madison, V/1). 1: was essential to linearize the DNA for the 
subsequent denaturation step. 0=1 puriRcalion was also essential in order to 
remove the restriction endonuclcasc buffer from the DNA. since the Mg^^ ions 
mai:e it difnculc to denature tr.c DNA ir. the ne.Kt step, 

(2) Random primed DNA synthesis 

Random primed DNA s:.v.t:-.=sis used to generate short DNA fragments 
from denatured, linear, double-sLranded DNA. The purified fl. subMs 
subtilisL-. E mutant genes, rr.Lxed with a molar e.xcess of primers, were 
denatured by boUmg, and s^v/'ncsis was tlnen carried out using one of the 
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following DNA polymerases: the Klenow fragment of £. coh DNA polymerase I. 
bacteriophage T4 DNA polymerase and T7 sequenase version 2.0 DNA 
polymerase. 

Under its optimal performance conditions (29), bacteriophage T4 DNA 
polymerase gives similar synthesis results as the Klenow fragment docs. \Vhcn 
T7 sequenase version 2.0 DNA polymerase (31, 32) is used, the lengths of the 
synthesized DNA fragments are usually larger. Some amount of MnClj has to 
be mcluded during the synthesis in order to control the lengths of the 
synthesized fragments within 50-400 bases. 

Short, nascent DNA fragments can aJso be generated with PGR using 
the Stoffel fragment of Tag DNA polymerase or Pfu DNA polymerase. An 
important consideration is to identify by routine e.^erimentation the reaction 
conditions which ensure that the short random primers can anneal to the 
te.mplates and give sufficient DNA amplificadon at higher temperatures. We 
have found that rando.-n primers as short as dp(N)n can be used v/ith PGR to 
generate fragments. 

2.1 Random primed DNA s>Tithc3is wiLh the Klenow fragment 

The Klenow fragmcn: of £. co/i DNA polymerase I lacks 5*-*3' 
exonuclcase activity, so that the random priming product is s>-n-^nesi2cd 
exclusively by orixr^z-: extension aj-.d is not degraded by exonuclcase. The 
reaction was carried oui at pH 6.6, where the 3*->5" exonuclease activity of the 
cnz-,Tnc is much reduced (35). These conditions favor random iniiiaiion of 
synihesis. 

1. 200 ng (abou: 0.7 pmol) of RI DNA and equal amoun: of ?a DNA 
dissolved in H2O was mLxed wit±i 13.25 pg (about 6.7 nmol) of dp(N)5 random 
primers. After immersion Ln boiling water for 5 minutes, Lhe mbcturc was 
placed im.mediately in am ice/cthanol bath. 

The size of the randorr. priming products is an inverse function of the 
concentration of primer (30). The presence of high concenU-ations of primer is 
thought to lead to steric hindrance. Under the reaction conditions described 
here the random priming products are approrcimatcly 50-500 bp, as 
. determined by agarose gel electrophoresis. 
2 Ten pi of 10 x reaction '\y^zr [lOx buffer: 900 mM HEPES, pH 6.6; 0.1 
M ma^esium chloride, 20 mM dilhiothreitol, and 5 mM each d.ATP, dCTP, 
dGTP and dTTP) %vas added to the denaaired sample, and the total volume of 
the reaction mixture was brought up to 95 pi with. H2O. 
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3. Ten units (about 5 nl) of the Klenow fragment of E.coli DNA polymerase 
I (Boehringcr Mai^nheim, IndianapoUs. IN) was added. All the comoonents 
were mixed by gently tapping the outside of the tube and were centrifuged at 
12.000 g for 1-2 seconds in a microfuge to move all the Uquid to the bottom. 
The reaction was carried out at 22°C for 3 hours. 

The rate of the extension depends upon the concentrations of the 
template and the four nucleotide precursors. Because the reaction was carried 
out under conditions that minimize exonucleolytic digestion, the newly 
synthesized products were not degraded to a detectable extent. 

4. After 3 hours at 22°C, the reaction was terminated by cooling the 
sample to O'C on ice. 100 pi of ice-cold H,0 was added to the reaction 
mlNture. 

5. The random primed produces were purified by passing the who!e 
reaction mbcture through M.crocon-100 (Amicon, Beverly MA) (to remove the 
template and proteins) and Microcon-10 fUters (to remove the primers and 
fragments less than 40 bases), successively. The retentate fraction (about 65 
Pl in volume) was recovered from tine .Microcon- 10. This fraction contaimng the 
desired random priming products was buffcr-e.xchangcd against PCR reaction 
buffer with the new Microcon-10 furiner use in whole gene reassembly. 

2.2 Random primed DNA sj-nuhesis wi-Jn bacteriophage T4 DNA pol>-merase 

Bactenophagc \~ Ljn.-\ uo.j i.:crao- c^.^. • 

DNA DoW-merase 1 zsc simila.- in that each possesses a 5--3' poh-mcrasc 
activity and a o-o e.v.onuciease activ:ry. The exonucleases actiricy of 
bacteriophage T4 DN.^ polymerase is more than 200 times that of th.t Kleno^v 
f.-agment. Since it does not displace the short oligonucleotide primers from 
sLngle-stranded DNA templates (23). th.s efnccncy of mutagenesis is different 
from the KJenow fragment. 

1. 200 ng (about 0.7 pmo!) of Rl DNA and equal amount of R2 DNA 
dissolved in H:0 was mixed with 13.25 pg (about 6.7 nm.ol) of dp(N)5 random 
or.mers. After immersion in boUir.g v.-atcr for 5 minutes, the mixture was 
placed immediately m an ice/et:.a.-.ol bath. The presence of high concentra- 
tions of primer is thought to lead to stcric hindrance. 
■ 2. Ten pi of 10 x reaction buffer jlOx buffer: 500 mM Tris-HCl. pH 8.8; 
150 mM (NH4)2S04; 70 mM magr.esium chloride, 100 mM 2-mcrcapto=thanol, 
0.2 mg/ml bovine serum albur^^n ar.d 2 mM each dATP. dCTP. dGTP and 
dTTP) was added to t:-^.= cenarured sample, and the total volume of the 
reaction mLxrure was brought u? to 90 pi with H2O. 
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3. Ten units (about 10 pi) of the T4 DNA polymerase I (Bochrmger 
Mannheim, Indianapolis, IN) was added. All the components were mixed by 
gently tapping the outside of the tube and were centrifuged at 12,000 g for 1-2 
seconds in a microfuge to move all the liquid to the bottom.^The reaction was 
carried out at 37°C for 30 minutes. Under the reaction conditions described 
here the random priming products are approximately 50-500 bp. 

4. After 30 minutes at 37°C, the reaction was terminated by cooling the 
sample to 0-C on ice. 100 pi of ice-cold H2O was added to the reaction 
mbcture. 

5. The random primed products were purified by passing the whole 
reaction mixture through Microcon-IOO (to remove the template and proteins) 
and Microcon-10 filters (to remove xhz primers and fragments less than 40 
bases), successively. The retcntate fraction (about 65 ul in volume) was 
recovcred'from the Microcon-10. This fraction containing the desired random 
priming products was buffer-exchanged against PGR reaction buffer ^.■i:h the 
new Microcon-IO furLher use in whole gene reassembly. 

2.3 Random primed DNA s>-nthesl3 unth the T7 scqucnase v2.0 DNA 
po!>Tncrasc 

Siace the T7 scqucnase v2.0 DNA polymerase lacks exonucleasc 
acuviry and is highly processivc. the average lenr^ of DNA s^-n'Jies^^ed is 
grca'.er than that of DNAs s-.-^-es'lzed by the Klenow iragmenl or DNA 
pol)Tnerase. But in the presence of proper amount of MnClj in the rcac-Jon. 
the size of the s>-nthesi2cd irag~cnts can be conlroUcd to less than 400 bps. 

1. 200 ng {about 0.7 pn^o!) of Rl DNA and equal amount of R2 DNA 
dissolved L-i H^O was mixed -.vith 13.25 pg (about 5.7 nmoi) of d?{N)5 randomi 
primers. After Lmmcrsion in boUing water for 5 minutes, the mbcnjre was 
placed immediately in an ice/ethanol bath. The presence of high concen- 
trations of pdmcr is thought to lead to steric hindrance. 

2. Ten \A of 10 x reaction buffer (!0X buffer: 400 mM Tris-HCl, pH 7.5; 
200 mM magnesium chloride, 500 m.M NaCl, 3 mM MnCb, and 3 mM each 
d,ATP, dCTP, dGTP and dTTP) was added to the denatured sample, and the 
total volume of t±ie reaction t?:-^.:.'z was brought up to 99.2 pi with H2O. 

'3. Ten units (about 0.8 pi) of tr.e T7 Sequenase v2.0 (Amersham Life 
Science, Cleveland, Ohio) was added. .All the components were mixed by 
gently tapping the outside of tlie tube and were centrifuged at 12,000 g for 1-2 
seconds in a microfuge to move all the liquid to the bottom. The reaction was 
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carried out at 22*C for 15 minutes. Under the reacuon conditions described 
here the random priming products are approximately 50-400 bps. 
4. After 15 minutes at 22'*C, the reaction was terminated by cooling the 
sample to 0°C on ice. 100 pi of ice-cold H2O was added to the reaction 
5 ■*• miTture. 

* 5. The random primed products were purified by passing the whole 
reacuon mbcture through Microcon-100 (to remove the template and proteins) 
and Microcon-10 filters (to remove the primers and fragments less than 40 
bases), successively. The retcntate fraction (about 65 pi in volume) was 
10 recovered from the Microcon-lO. This fraction containing the desired random 

priming products was buffer-exchanged against PGR reaction buffer with the 

■ new Microcon-10 further use in whole gene reassembly. 

2.4 Random primed DNA synt±icsis wit±i PGR using the Stoffcl fragment of 
15 Tag DNA pol>Tnerase 

Similar to the Klenow fragmcn: of E. coli DNA polymerase I, the Stoffel 
fragment of Tag DNA pol>Tncrase lacks 5' to 3" exonuclease activity, it is also 
more thermostable than Tag DNA pc!>-n:erasc. The Stoffel fragment has low 
processivip^, c:ctcndLng a primer an average of only 5-10 nucleotides before it 
dissociates. As a result of its lower proccssivicy, it may also have improved 
fidclir)'. 

1 50 ng (about 0,175 prr.o!) of Rl DNA and equal amount of R2 DNA 

cissoived in H:0 was mLxcd v/iL". 5.13 (about 1.7 nmol) of dp(N)i3 random 
primers. 

2. Ten pi of 10>: reaction prc-mtx [lOx reaction pre-mLx: 100 mM Tris-HCl, 
oH 8.3; 30 mNl .magnesium chloride, 100 mM KCl, aind 2 m.M each dATP, 
dCTP, dGTP and dTTF) was added, and the total volume of the reaction 
mixture was brought up to 99.0 pi v-ith H2O. 

3. .After incubation at 96°C for 5 mmutes, 2.5 units (about 1.0 pi) of the 
jc Stoffel fragment of Tag DNA polvTTicrasc (Perkin-Elmer Corp., Norv.'al]-:, CT) 

was added. Thirty-five thermoq/cles v.-cre performed, each with 60 seconds at 
gS'^C. 60 seconds at SS^'C and 50 scc':.-ds at 72°C, \%-ithout the extension step 
of the last cycle, in a DNA Engine P7C-2C0 (MJ Research Inc., Watcrtown, M.A) 
aooararjs. Under the reaction condiuop.s described here the random priming 
35 products are approximately 50-500 bp. 

4. The reaction was terminated by coolLng the sample to O'C on ice. 100 pi 
of ice-cold H7O was added to Uhe reaction mixture. 



20 
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5. The random primed products were purified by passing the whole 
reaction mixture through Microcon-lOO (to remove the template and proteins) 
and Microcon-10 filters (to remove the primers and fragments less than 40 
bases), successively. The retentate fraction (about 65 pi in volume) was 
5 recovered from the Microcon-10, This fraction containing the desired random 

priming products was buffer- exchanged against PGR reaction buffer widi the 
new Microcon-10 further use in whole gene reassembly. 

2.5 Random primed DNA s>'nthesis with PGR using Pfu DNA polymerase 
10 Pfu DNA polymerase is extremely thermostable, and the enzyme 

possesses an inherent 3* to 5* exonuclease activity but does not possess a 
5*_>3' exonuclease activity, ks base substicution fidelity has been estimated to 

be 2 X 10-6, 

1. 50 ng (about 0.175 pmol) of Rl DNA and equal amount of R2 DN.A 
15 dissolved in H^O was mixed wiin 6.13iT:g (about 1.7 nmol) of dp(N)i2 random 

primers. 

2. Fifty \x\ of 2 x reaction prc-rr.LX (2 x reaction prc-mLx: 5-fold diluted 
cloned Pfu buffer (Stratagene, La JoUa, CA). 0.4 mM each dNTPJ, was added, 
and the total volume of Lhc rcacJon rr.Lxa:rc was brought up to 99.0 jjI wi^h 

20 H2O. 

3. Afccr incubation a: 96'C for 5 minutes, 2.5 units (about I.O of Pfu 
DNA polvTCcrasc (Stratagene. La Jo!!a, CA) was added. Thirry-five chcrmo- 
cvcics were performed, each wiu" 60 seconds at 95''C, 60 seconds at 55'C and 
50 seconds at 72°C, without the extension step of the last c>'cle, in a DNA 

2 5 Engine PTC-200 (MJ Research Inc., V/atertov.-n, MA) apparatus. Under the 

reacdon conditions described here the major random priming products are 
approximately 50-500 bp. 

4. The reaction was terminated by cooling the sample to 0°C on ice. 100 
pi of ice-cold H2O was added to the reaction mucture. 

30 5. The random primed prod'jcts v/ere purified by passing the whole 

reaction mLxture through Microcon-lCO (to remove Lhc template and proteins) 
*and Microcon-10 filters (to ren^.ovc tr.c primers SLnd fragments less thain 40 
bases), successively. The retentate fraction (about 65 \x\ in volume) was 
recovered from the Microcon-10. This fraction containing the desired random 

3 5 priming products was buffer-exchanged against PGR reacdon buffer \rith the 

new Microcon-10 further use in whole gene reassembly. 
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(3) Reassembly of the whole gene 

1. For reassembly by PGR. 10 jil of the random-priined DNA fragments 
from Microcor;-10, 20 m1 of 2 X PGR pre-mix (5-fold diluted clor^ed Pfu buffer, 
0.5 mM each dNTP. O.lU/nl cloned Pfu polymerase (Stratagene, La JoUa. CA)). 
15 pi of HaO were mixed on ice. 

2 After incubation at 96-C for 3 minutes. 40 thermocycles were 
performed, each with 1.0 minute at 95'G, 1.0 minute at 55-G and 1.0 minute 
: . 5 second/cycle at 72'C. with the e.xtension step of the last cycle proceeding 
at 72»C for 10 minutes, in a DNA Engine PTC-200 (MJ Research Inc., 
Watertown, MA) apparatus without adding any mineral oil. 
3 3 Ml aliquots at cycles 20, 30 and 40 were removed from the reaction 
mixture and analyzed by agarose e=1 electrophoresis. The reassembled PGR 
product at 40 cycles contained the correct si.e p.'oduct in a smear of larger 
and smaller sizes. 

(4) AmDlification 

The corrccUy reassembled product of this Hrst PGR was further 
amplified in a second PGR reaction which contained the PGR primers 
complementary to th.e ends of th.c tcmpla'-c DNA. 

1 2.0 Ml of the PGR reassembly aiiquocs were used as template in lOO-pl 
standard PGR reactions, which contained 0.3 mM each primers of PI (5' 
CCG.^GCGTTGC ATATGTGG.V.G S) i:.^,^. ■ ~ - 

CG.ACTGTAG.AGG.MCGG.ATTC 3) (SEQ. ID. NO: 16), 1.5 mM MgGb, 10 mM 
Tns-HGl [pH 9.0]. 50 mM KCl, 200 mM each of the four cNiPs, 2.5 U of Tag 

/o ^-ap Madison V/i US.^) and 2.5 U of Pfu pol>-merase 
polv-merase (Promega, iMaoison, wo , 

{Stratagene, La JoUa, G.A). 

2. After Lncubation at 96'G for 3 mmutes, 15 thermocycles were 
oerformed, each with 60 seconds at 95°G, 60 seconds at 55»C and 50 seconds 
at 7TC followed by additional 15 tr.ermoo-cles of 60 seconds at 95'G. 60 
seconds at 55'G and 50 seconds 5 second/cycle) at 72'G .-ith the extension 
step of the last cycle proceedmg at 72 = C for 10 minutes, in a DNA Engme PTG- 
200 (MJ Research Inc., V/atenov.n, M.- apparatus v.-ithout adding any mineral 



oil. 



^. The azr.pliiicadon resulted i.-. a large am.ount of PGR product w^th the 
correct size of Lhe subtiiisLn E whole gene. 



(5) Cloning 
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Since the short DNA fragments were generated wth five different DNA 
polymerases, there were five pools of final PGR amplified reassembled 
products. Each of the DNA pool was used for constructing the corresponding 
subtilisin E mutant library. 

1. The PGR amplified reassembled product was purified by Wizard DNA- 
GleanUp kit (Promega, Madison, WI), digested with Bam HI and me 1. 
electrophoresed in a 0.8% agarose gel. The 986-bp product was cut from the 
gel and purified by Wizard PGR Prep kit (Promega, Madison. WI). Products 
were ligated with vector generated by Bam Hl-Nde 1 digestion of the pBE3 
shutde vector. 

2. £. coli HBIOl competent cells were transformed with the above ligation 
mLrture to form a mutant library. About 4,000 transformants from this Ubrary 
were pooled, and recombinant plasmid mbcturc was isolated from this pool. 

3. B. sublilis D3428 compcreat cells were transformed vAih the above 
isolated plasmid mixture to form another Ubraiy of the subtilisin E variants. 

4. Based on the DNA pol>-merass used for random priming the short, 
nascent DNA fragments, the five libra-rics constructed here were named: 
librai^VKlenow, library/T4. Hbran'/Scqucaase. library/Stoffel and library/Pfu. 
About 400 tranformants from each library were randomly picked aj^d 
subjected to screening for thsrmostabilitj' (see Step (7)]. 

(5) Random clone sequencing 

Ten random clones from the 3. s'Sotilis DB423 Ubrarz/KJenow was 
chosen for DNA sequence anaJysis. RccombLnant plasmids were individually 
purified from 8. subdlis DB423 usLng a QUprcp spin plasmid mmiprcp kit 
(QUGEN) wi-Ji the modificauon th.at 2 mg/ml lysozyme was added to PI buffer 
and the cells were incubated for 5 minutes at 37'C. retransformed into 
competent E. coli H3 101 and then purified again using QlAprep spin plasmid 
miniprep kit to obtain sequencing qualiry DNA. Sequencing was done on an 
ABl 373 DNA Sequencing System using the Dye Terminator Gycle Sequencing 
kit (PerkJn-Elmer Gorp., Noru-al:-:, CT). 

(7) Screening for themostabdity 

About 400 transformar.ts from each of the five Ubraries described at 
Step (4) were subjected to screening. Screening was based on the assay 
described previously (33, 35), using succinyl-Ala-Ala-Pro-Phc-p-nitroanUidc 
(SEQ. ID. NO: 25) as substrate. B. subdlis DB428 containing Lhe plasmid 
Ubrary were grov^m on LB/kanamycln (20 pg/ml) plates. After 18 hours at 
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aye single colonies were picked into 96-well plates containing 100 pi 
SG/kanamycin medium per well. These plates were shaken and incubated at 

: aye for 24 hours to let the cells to grow to saturation. The cells were spun 
down, and the supematants were sampled for the thermostability assay. 
Three replica 96-well assay plates were duplicated for each growth plate, with 

- each well containing 10 ml of supernatant. The subtilisin activities were then 
measured by adding 100 ml of activity assay solution (0.2 mM succinyl-AJa- 
Ala-Pro-Phe-p-niu-oaniUde (SEQ. ID. NO: 25), 100 mM Tris-HCl, 10 mM CaCb, 
pH 8.0, 37 **C). Reaction velocities were measured at 405 nm over 1.0 min. in 
a ThermoMax microplate reader (Molecular Devices, Sunnyvale CA). Activity 
measured at room temperacure was used to calculate the fraction of active 
clones (clones with activity less than 10% of that of wild type were scored as 

■ inactive). Initial activity (Ai) was measured after incubating one assay plate at 
65*'C for 10 minutes by immediately adding 100 pi of prcwarmed (37*C) assay 
solution (0.2mM succinyl-.\Ja-.^a-Pro-Phe-p-nitroaniHdc (SEQ. ID. NO: 25), 
100 mM Tris-HCl, pH 8.0, 10 mM CaCb) into each well. Residual actinty (.Ar) 
was measured after 40 m.Laute incubation. 

(8) Sequence Analijsis 

After screcnLng. one clone that showed Lhe highest thermostability 
v/iihin the 400 iransformar.ts fron: the Hbra.'-y/Kienov/ was re-strealied on 
L3/kanamycin agar plate, and sLngle colonics derived from this plate were 
inocularcd into tube cuirurcs. I'or glycerol stock and plasmid preparation. The 
recombinant plasmid was purii'cd using a QL-Aorep spin plasmid miniprcp kit 
(QIAGEN) v;iLh the modiilcation that 2 mg/m.l lysor\*me was added to PI buffer 
and the cells were incubated for 5 minutes at 37°C, rctransformcd into 
competent £. co/t HB 101 and then purified again using QUprep spin plasmid 
miniprep Vdt to obtaLn sequencing qucdity DNA. Sequencing was done on an 
ABI 373 DNA Sequencing System using the Dye Term.Lnator Cycle Sequencing 
kit (Perkin-Elmer Corp.. Non%'ai:-:. CT). 

Results 

I. Recombination frequency ar.d efficiency associated with the random- 
sequence recombination. 

The random pruned process was carried out as described above. The 
process is illusu-ated in FIG. 1. Ten clones from Lhe mutant library/Klenow 
v/ere selected at random and sequenced. As summauized Ln FIG. 12 and Table 
5, ail clones were different from the parent genes. The frequency of occurrence 
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of a particular point mutation from parent Rl or R2 in the recombined genes 
ranged from 40% to 70%, fluctuating around the expected value of 50%. This 
indicates that the two parent genes have been nearly randomly recombined 
with the random primer technique, FIG. 12 also shows that all ten mutations 
can be recombined or dissected, even those that arc only 12 bp apart. 

We then estimated the rates of subtilisin thcrmoinactivation at 65*C by 
analyzing the 400 random clones from each of the five libraries constructed at 
Step (5). The thermostabilities obtained from one 96-weIl plate arc shown in 
FIG. 13, plotted in descendLng order. Approximately 21% of the clones 
exhibited thermostability comparable to the mutant with the N181D and 
N218S double mutations. This indicates that the N181D mutation from RC2 
and the N218S mutation from RCl have been randomly recombined. 
Sequence analysis of the clone exhibiting the highest thermostability among 
the screened 400 transformants from the library/ Klenow showed the mutation 
N161D and N218S did exist. 

2. Frequency of newly introducjc r-utations during the random primLng 
process 

Appro.Kimatcly 400 L-ansforrr.ants from each of the five 3.sublilis 
D3428 libraries (sec Step (5)! were picked, grov.-n in SG medium 
supplemented with 20 ug/"l kar.amycin in 96-well plates and subjected to 
sub'JUsin E activity screening, .-^.p proximately 77-84% of the clones expressed 
acLivc enz>Taes, while l6-23Vo of ihc transformants v/ere inactive, presumably 
as a result of nev.'ly ir.L-c:iuccc rr.utations. From prc-ious experience, we 
kno^' that this rate of inacuvaoon indicates a mutation rate on the order of 1 
to 2 mutations per gene (35). 

As shown in FIG. 12, 18 nev/ point mutations were introduced in the 
process. This error rate of 0.18% corresponds to 1-2 new point mutations per 
gene, which is a rate that has been determined from the inactivation cur^'e. 
Mutations are nearly randon-.ly dislnbuted along the gene. 
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TABLE 5 



DNA and amino acid residue substitutions in the ten 
clones from Library/Klenow 



random 



Base 



Substitution 



Amino Acid 
SubstiniDon 



Substitution 

TVTJC 



CHI 


839 


A-+C 


transvcrsion 


GIy->Gly 


synonymous 


CH7 


722 


A-*G 


transition 


Scr-+Scr 


5ynon>'mous 


Ctt2 


902 


T-*C 


transition 


Val-^VaJ 


synon>*mous 


C#2 


1117 


C->G 


transvcrsion 


Scr-+Scr 


synor.>Tnous 


Cn4 


809 


T-^C 


transition 


Asn-*Asn 


synon>'Tnous 


C#4 


1093 


G-^C 


transvcrsion 


Gly-»Ala 


n 0 n - s>T. 0 n ym 0 u s 


C#4 


1 102 


T-^C 


transition 


Ala-+Ala 


synonvTnous 


Ci*6 


653 


C-tA 


transvcrsion 




non-r.T.on^Tnou s 


cne 


654 


A-»T 


transvcrsion 


His-tllc 


non-s;.Tion>Taous 


Cw6 


657 


T->C 


Lransition 


Val-4AJa 


non-s>'nonyTnous 


C«6 


653 


A-*C 


transvcrsion 


Val-+AU 


non-s>T'.on>Tnous 


C«6 


I 144 


A-^G 


Iran si lion 


Aia-*Ala 


synon>Tnous 


C«6 


I 147 


A-»G 


Lransi'Jon 


Ala-+ALa 


synon>Tnous 


C^7 


478 


T->C 


Lrar.sitJon 


Ilc-*Ilc 


3yr.or.>Tnous 


C«9 


731 


A-+C 


L-ar.si'Jon 


Ala-+Ala 


s>Tsor.>Tnous 


C«9 


994 


A-+G 


L-ansition 


Val-*V;iI 


syncn>-mo'Js 


C* 10 


11 11 


A-G 


L-'ansi'Jon 


Gly-Gly 


s>-non>'Tno'JS 


C**10 


1 I 12 


A-^T 


irar.svcrsion 


Thr-^S-r 


non-S;-r.on;,-mo*j s 



The rr^/Jiauon n^es are listed in TABLE 5. The direcdon of muiauon is 
clearly nonrar.dorr.. Fc: cxa.-T.plc, A changes mere often to G than to either T 
or C. .M\ ti-ansllions, ar.d in particular T-C and A-G, occur more often than 
transvcrsion. Some nucleotides are more mutable than others. One G-^C. one 
C-^G and one C->A tran.sversions were found wiihLn the 10 sequenced clones. 
These mutations were generated very rarely during the error-prone PGR 
mutagenesis of subtiUsin (37). Random.-priming process may allow access to a 
greater range of ammo acid substitutions than PCR-based point mutagenesis. 

It is interesting to note that a short stretch of 5' C GGT ACG CAT GTA 
GCC GGT ACG 3* (SEQ. ID. NO: 16) at the position 646-667 in parents Rl and 
. was mutated to o C GGT .^CG .-.TT GCC GCC GGT ACG 3' (SEQ. ID. NO: 
17) in random clone C = 6. Since tr.c stretch contains wo short repeats at th.e 
both ends, thie newly inu-oduced mutations may result from a splipped-strand 
mispairing process instead of pomt-mutation only process. Since there is no 
frame-shift, this kind of slippage may be useful for domain conversion. 
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3. Comparison of different DNA polymerase fidelity in the rand cm -priming 
process 

During random-priming recombination, homologous DNA sequences 
are nearly randomly rccombined and new point mutations are also 
introduced. Though these point rauUtions may provide useful diversity for 
some in vitro evolution applications, they arc problematic recombination of 
beneficial mutations already identified previously, especially when the 
mutation rate is this high, Conu-olling error rate during random ■ priming 
process is particularly important for successfully applying this technique to 
solve in uiiro evolution problems. By choosing different DNA polymerase and 
modifying the reaction conditions, the random priming molecular brcedmg 
technique can be adjusted to generate mutant libraries w-ith different error 
rates. 

The Klenow fragment of £.cofi DNA polymerase 1, bacteriophage T4 DNA 
polymerase, T7 sequcnasc version 2,0 DNA polymerase, the Stoffcl fragment of 
Taq polymerase and Pfu pol>Tr.crasc have been tested for the nascent DN,A 
fragment s>Tithcsis. The acti'.-ity profiies of the resulting five populations (sec 
Step (5)1 are shown in FIG. 13. To generate these profiles, activities of the 
individual clones measured in the 95-weil plate screening assay are ploncd in 
descending order. The Library/ See fi'ei and Library/ Kie now contain higher 
percentage of wild-TrTie or inactive subtiiisin E clones than that of the 
Library/Pfu. la all five popula'^ons. percentage of the wild-type and inactive 
clones ranges from 17-30%. 

EXAMPLE 8 

Use of defined Hankiag primers and staggered extension 
to recombiae single stranded DNA 

This exELmple demonstrates the use of the defined primer recombi- 
nation with staggered extension in the recombination of sLngle stranded DNA. 

Method Description 

SLngle-stranded DNA can be prepared by a variet>- of m.ethods, most 
easily from plasmids using helper phage. N^any vectors in current use arc 
derived from filaimentous phages, such as M13mp derivatives. .After 
transformation into ceils, these vectors can give rise both to a new double- 
stranded circles and to a single-stranded circles derived from one of the tv^'o 
strands of the vector. SLngle-stranded cL^cles are packaged into phage 
narticies, secreted from cells and can be easily purified from the culture 
suoematant. 
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Two defined primers (for example, hybridizing to a' and 3' ends of the 
templates) are used here to recombine single stranded genes. Only one of the 
primers is needed before the fmal PGR amplification. Extended recombination 
primers are first generated by the staggered extension process (StEP), which 
consists of repeating cycles of denaturation foUowed by extremely abbreviated 
annealing/extension step(s). The extended fragments are then reassembled 
into full-length genes by thermocycling-assisted homologous gene assembly Ln 
the presence of a DNA polymerase, followed by a gene amplification step. 

The progress of the staggered extension process is monitored by 
removing aliquots (10 ul) from the reaction tube (100 ul starting volume) at 
various time points in the primer extension and separating DNA fragments by 
agarose gel electrophoresis. Evidence of effective primer extension is seen as 
appearance of a low molecular weight 'smear' early in the process which 
increases in molecular weight v-iLh increasing cycle number, initial reaction 
conditions are set to allow template denaturation (for example, 94'C-30 
second denaturation) followed by very brief annealing/ extension step(s] (e.g. 
55'C-l to 15 seconds) repeated L-.rough 5-20 c>-cle increments prior to 
reaction sampling. Typically. 20-200 cycles of staggered e.xtension are 
required to generate single strar.ded DNA 'smears' corrcspondLng to si2es 
greater than the length of the complete gene. 

The e.xp=rirr.cnt.a! design is as in Example 1. Two thermostable 
subliUsin E mutar.ts Rl and Kl' gcr.c arc subcioncc into vector MlSrr.r^lS bv 
rcstric-Jon digcsuon with EcoRi and 3amHi. Single stranded DNA is prepared 
as described (39). 

Two flanking primer based reco~birj2iion 

T^vo defined primers, ?5N (5 -CCG.AG CGTTG C.AT.AT GTGGA AG-S' 
(SEQ. ID. NO: 18), underlined sequence is Ndel restriction site) and P3B (5 - 
CG.ACT CTAGA GG.^TC C GATT 0-3' (SEQ. ID. NO; 19), underlined sequence is 
BamHI restriction site), corresponding to 5' and 3' flanking primers, 
respectively, are used for rccombL-.auon. Conditions (100 ul final volume): 
0.15 pmol single-stranded DNA con:a.nLr.g Rl and R2 gene (mixed at 1:1) are 
used as template. 15 pmol of one I'anV-ing primer (either P5N or F33), Ix Taq 
■buffer, 0.2 mM of each dNTP, 1.5 mM MgCh and 0.25 U Taq polymerase. 
Program: 5 minutes of 95=C, 80-200 cycles of 30 seconds at 94-C, 5 seconds 
at 55°C. The single-stranded DNA products of correct size (appro.-dmatcly 
Ikb) are cut from 0.8% agarose gei after electrophoresis and purified using 
QLAEX 1! gel extraction kit. This puriiled product is amplified by a 
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conventional PGR. Condition (100 ul final volume): 1-10 ng of template, 30 
pmol of each flanking primer, Ix Tag buffer, 0.2 mM of each dNTP, 1.5 mM 
MgCb and 0.25 U Tag polymerase. Program: 5 minutes at 95*0, 20 cycles of 
30 seconds at 94'C, 30 seconds at 55'C, 1 minute at 72»C. The PGR product 
is purified, digested with Ndcl and BamHI and subcloned into pBE3 shuttle 
vector. This gene library is amplified in £. coli HBlOl and transferred into B. 
subtilis DB428 competent cells for expression and screening, as described 
elsewhere (35). Thermostability of enzyme variants is determined in the 96- 
well plate format described previously (33). 

This protocol results in the generation of novel sequences containing 
novel combinations of mutations from the parental sequences as well as novel 
point mutations. Screening allows the identification of enzyme variants that 
are more thermostable than the parent enzymes, as in E.xamplc 1. 

As is apparent from the above e.xamples, primer-based recombination 
may be used to e.xplore the vast space of potentially useful catalysts for their ' 
opdmal performance in a wide range of applications as well as to develop or 
evolve new enzymes for basic s-njcrjre-function studies. 

V/hile the present spsciHcation describes using DNA-dcpendent DNA 
poK-TT.erase and single-stranded DNA as templates, alternative protocols are 
also feasible for usL-.g sLngle-str ar.dcd RNA as a template. By using specific 
protein mRNA as the template ar.d RNA-dc?cndcnt DNA pol>Tnerasc (reverse 
transcriptase) as the catalyst, the methods described herein may be modified 
to introduce mutations a.-.d crossovers into cDNA clones and to create 
mo!:cular divcrsir/ di.'sctly :-rom the m.RNA level to achieve the goal of 
optimizing protein functions. This would greatly simplify the ETS (expression- 
lagged strategy) for novel catalyst discovery. 

In addition to the above. Lhe present invention is also useful to probe 
proteins from obligate intracellular pathogens or other systems where cells of 
interest cannot be propagated (38). 

Having thus described e.xcrr.plajy embodiments of the present inven- 
tion, It should be noted by tr.osc sVdlicd in the art that the ^vithln disclosures 
are exemplary only and that va.-:o'Js other alternatives, adaptations, and 
modifications may be mace v.-J-^.;.-^. the scope of th.e present invention. 
Accordingly, the present Lnvcntior. is not Umited to th.e specific embodiments 
as Ulustratcd herein, but is only U-itcd by the follovnng claims. 
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SEQUHNCZ LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICWrrS: Frances H. Arnold 

Zhixin Shao 
Joseph A. Affholcer 
Huimin Zhao 
Lori Giver 

(ii) TITLE OF INVENTION: RecorrOai nation of Polynucleotide 
Sequences Using Defined or Random Primer Sequences 

(iii) NUMBER OF SEQUENCES: 25 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Oppenheimer Poms Smith 

(3) STREET: 2029 Century Park East, Suite 3300 

(C) CITY: Los Angeles 

(D) STATE: CA 

(El COUNTRY: USA 
(F) ZIP: 90067 

(v) COMPUTER HEADAaLE FO?J< : 

(A) MEDIU>! TYPE: Floppy dis:-: 
(3) COMPUTER: IBM PC compatible 

(C) 0PEPJ1.TING SYSTEM: Wir.dovs 

(D) SOFTWARE: Microsoft Word 6.0 

(vi) CJR-REJJT APPLICATIO:: DATA: 
(A) APPLICATIO:: jrj>'.3E?.: 
(3) FILIN'G DATE : 
(C) CLASSiriCATXON': 

(vii) ??.IOR APPLICATION' DATA: 

(A) application: J"J:-:3E?-: 60/OA1,656 
(3) FILI::G DATE: March 25, 199"? 

(C) APPLICATIO:: -JSA^Ztr, 60/0n5,211 

(D) FILING DATE: April 30, 1997 

(E) APPLICATIO:: rrj>:3E?.: 60/0-^S.256 

(F) FILI::G date: May 12, 1997 

(viii) ATTOPJIEY/AGE.^rr INFOPJ-^^TIOS : 

(A) NA.ME: Oldenkamp, David J. 

(3) REGIST?-ATIO;i \T'S:\3Z?r. 2 9 , -i 2 1 

(C) REFEREMCE/D0C-:3T >.Tj->'3E?. : 330187 -8-i 

(ix) TELECOMJ-OJ^IICATION' IN'FOPJ'-ATIOS' : 
(A) TELEPHONrE: (310) 725-5000 
(3) TELEFA:<: (3101 277 -12 57 

'(2) nirOPJ-lATlON' FOR SEO ID ::0 : 1: 

(il SEQUENCE CKAJ^^CTERISTICS : 
(A) LENGTH: 22 nucleotides 
(3) TYPE: nucleotide 
(C) TOPOLOGY: linear 



(ii) MOLECULE TYPE 



oligor.ucieotide 
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(xi) SEQUENCE DESCRIPTIO.V: SEQ ID NO: 1 
CCG AGC GTT GCA TAT GTG GAA G 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 nucleotides 

(B) TYPE: nucleotide 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 
CGA CTC TAG AGG ATC CGA TTC 



(2) INFORMATION FOR SEQ ID NO: 3: 

(11 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 nucleosides 

(B) TYPE: nucleotide 

(C) TOPOLOGY: linear 

(ii) KOLECULE TYPE: oligonucleotide 
(vi) SEQUENCE DESCRI?TIO^: : SEQ ID S'0 : 3 
GAG CAC ATC AGA TC i /^ * i 



(2) INT0R:-'ATI0N' for S£0 10 ::0: 

( i ) S EQl^E^:C =: C rS^S^ZTZ ?. I 5 T I C S : 
(A) LENGTH: 2 1 r.ucleotides 
(Bl TYPE: nucleotide 
(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 

(xi) SEQUE^^CE DESCRIPTION: SEQ ID NO: 

GGA GTG GCT CAC AGT CGG TGG 



(21 IN'FOPJ-'ATION' FOR SEQ ID NO: 5; 

(i) SEQUENCE CHA-RACTERISTIC3 : 

(A) LENGTH: 21 nucleotides 

(B) TYPE: nucleotide 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 
(xi) S£QUZ::CZ DESCRIPTION: SEQ ID N'O : 
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TTG AAC TAT CGG CTG GGG CGG 

(2) IHFORMATIOH FOR SEQ ID KO: 6: 

(i) SEQUrr/CE CHARACTERISTICS: 

(A) LENGTH: 21 nucleotides 

(B) TYPE: nucleocide 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: ol igonucleot: ide 
(xi) SEQUE^rCE DESCRIPTION: SEQ ID NO: 
TTA CTA GGG AAG CCG CTG GCA 

(2) IN-FORMATIO:^ FOR SEQ ID ;:0 : 7: 

(i) SEQUEN'CE CHARACTERISTICS: 
(A) LENGTH: 2 1 nucleotides 
(3) T*r?E: nucleotide 

(C) TOPOLOGY: linear 

(iil MOLECULE TYPE: ol igonucl eo: ide 
(xi) SEOUEN'CE DESCRIPTION.*: St^ ZD N'O : 
TCA GAG ATT ACG ATC G AA A^-w 

(2) i::ro?-;-iAT:o:: .-o?. seq id j:o; B: 

(A) LE::GTH: 21 nucleotides 
(3) TYPE: nucleotide 
(C) TOPOLCGV: linear 

(ii) MOLECULE TYPE: oligonucleotide 

(;-;i) SEQUENCE DESCRIPTION:: SEQ ID NO: 

GGA TTG TAT CGT GTG AGA AAG 

(2) IN'FOPJ-IATIOM FOR SEQ ID N'0 : 9; 

(i) SEQUENCE CH.A?-ACTERIST:CS : 
(A) LENGTH: 21 nucleotides 
(3) TYPE: nucleotide 
(C) TOPOLOG:: linear 

(ii) MOLECULE TYPE: ol igonuc lee t ide 
(xi) SEQUr::CE DESCRIPTIC:.*: SEQ ID 

A-AT GCC GGA AGC AGC C«wC 1 
(2) INFOP-^LATION FOR SEQ ID NO: 10: 
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(i) SEQUZNCE CHARACTERISTICS: 

(A) LENGTH: 21 nucleocides 

(B) TYPE: nucleotide 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 
CAC GAC AGG AAG ATT TTG ACT 



21 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 nucleocides _ • 
(3) TYPE: nucleotide 
(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID V<0\ 11 

ACT TAA TCT AGA GGC TAT TA 



20 



(2) I^^:OR:<ATION FOR SEQ ID N'0 : 12: 

(i) SEQUENCE CHARACTERISTICS; 
(A) LENGTH: 2 0 nucleo:i-es 
(3) TYPE: nucleoside 
(C) TOPOLOGY: linear 

(ii) rOLECULE TYPE: o 1 i ccn-C 1 G i d- 

(xi) SEQUE^:cE de:scr:?tic::: seq id no: 12 : 

ACC CTC GCG GGA TCC CCG GG 



20 



(2) INFOPJ-'ATIO:: FOR SEQ ID NO: 13: 

(i) seque;:ce characteristics: 

(A) LE::GTH: 2S nucleotides 
(3) TYPE: nucleotide 
(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oi igonucl ec t ide 

(xi) SEQUENCE DESCRIPTXO::: SE? ID NO: 13 

GGT AGA GCG ACT CTC GAG GGG GAG ATG C 



23 



(2) INFOPa-tATIOri "OR SEQ ID NO: 1 : 

(i) SEQUENCE CHAR-ACTERISTICS: 
(A) LENGTH: 22 nucleocides 



(21 ir.TOPyj.'TlOr. FOR SZQ ID 



(A) LENGTH: 2 2 n'-cleoCices 
(31 TYPE : r.u"leo^ide 
(C) TOPOLOGi : lir.ear 



22 



WO 98/42832 PCT/US98/05956 

.57 

(B) TYPE: nucleotide 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 
(xi) SEQUENCE DESCRIPTIOH: SEQ ID NO: 14: 
AGO CGG CGT GAC GTG GGT CAG C 2 2 

(2) INF0R.MATION FOR SEQ ID NO: 15: 

(i) SEQUENCE C?L?JIACTE?.ISTICS: 

(A) LENGTH: 22 nucleotides 

(B) TYPE: nucleotide 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: ol igor.uc leo t ide 
(xi) SZO'JZy^CZ DESCRIPTION: SEQ ID NO: IS: 
CCG AGC GTT GCA TAT GTG GAA G 



(2) INEOP-^IATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CKAPJ^CTERISTICS : 
(A) LENGTH: 21 nucleo-ides 
(3) TYPE: nucleotide 

(C) TOPOLOGY: linear 

(ii) ;-:OLECULE TYPE: ol ico" -cleot idc 
(xi) SEO'JENCE DESCRI ?TIO:: : SEQ ID N'O : 15: 
CGA CTC TA3 AGG A;^ C^/-v i . 

(2) IN'FOPJ'ATICN FOR SEQ ID N'0 : 17: 

(i) SEQ'JENCE CKAP-ACTERISTICS : 
{Al LENGTH: 22 nucleotides 
(31 TYPE: nucleotide 
(C) TOPOLOGY: linear 

(ii) r.OLECULE TYPE: oligonucleotide 

(xi) SEQUENCE DESCRIPTION*: SEQ ID NO: 17 
CGG TAC GCA TGT AGC CGG TAC G 



22 
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(ii) MOLECULE TYPE: oligonucleotide 
(xi) SEQUEMCE DESCRIPTION: SEQ ID NO: 16: 
CGG TAC GAT TGC CGC CGG TAG G 

(2) IKrORMATIOM FOR SEQ ID KO : 19: 

(i) SEQUEMCE CHARACTERISTICS: 

(A) . LENGTH: 22 nucleotides 

(B) TYPE: nucleotide 

(C) TOPOLOGY; linear 

(ii) MOLECULE TYPE: oligonucleotide 
(xi) SZQUZy^CZ DESCRIPTION*: SEQ ID NO: 19: 
CCG AGC G7T GCA TAT GTG GAA C 



(2) IMTORMATION' FOR SEQ ID S*0 : 20: 

(i) SEQUE;:CE CHARACTERISTICS : 

(A) LENGTH: 21 nucleotides 

(B) TYPE: nucleotide 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 
(xi) SEQUENCE DESCRIPTION': SEQ ID NO: 20: 
CCA CTC TAG AGG AiC C^^*^ 4 



(2} INrOR/iATION 70?. SEQ 13 N'O : 21: 



[i) SEQUE^:CE Cr:AJ=-ACTERI3TICS : 
(A) LENGTH: 13 nucleotides 
(3) TVPE: nucleotide 
(C) TOPOLOGY: .linear 



(ii) MOLECULE TYPE: oligonucleotide 



(xi) SEQUENCE DESCRI ?TIC^: : SEQ 



ZD 



21 : 



GGC GGA GET 



18 



(2) INTOPJ-IATICN 



rOR SEQ ID N'O : 2 2: 



(i) SEQUENCE CHARACTERISTICS; 
(A) LE.NGTH: 13 nucleotides 
(3) TYPE: nucleotide 
(C) TOPOLOGi: linear 



(ii) XOLECUL 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 
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GAT GTG ATG GCT CCT GGC 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 nucleotides 

(B) TYPE: nucleotide 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 
(xi) SEQUENCE DESCRIPTIOy/ : SEQ ID NO: 23: 
CAG AAC ACC GAT TGA GTT 



(21 INFOR^-tATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 13 nucleotides 
(Bl TYPE: nucleotide 
(C) TOPOLOGY: linear 

(ii) KOLECULE TYPE: oligonucleotide 

(xi) SEQUE>:CE DESCRIPTION: SEQ ID I/O: 2-;: 

ACT GCT TTC TAA ACG ATC 



(2) i;:fo?j-iatio:i for seq id ;;o: 25; 

(i) SEQuEN'CE CrlAR^CTZRISTICS : 
K^k] LENGTH *; a-ir.o acids 
(3) T".''?E: peptide 
(C) TOPOLCG:: linear 

(ii) KOLECULE TYPHI: pepiide 

(xi) SEQUEN'CE DESCRIPTION': SEQ ID NO: 25: 
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CLAIMS 



What is claimed is : 

A method for making double-stranded mutagcnized 
polynucleotides from at least one template polynucleotide wherein said 
mutagcnized polynucleotides has at least one nucleotide which is difTerent 
from the nucleotide at the same position in said template polynucleotide, said 
method comprising: 

a) conducting crL2yme- catalyzed DNA polymerization synthesis from 
random-sequence or defined-sequence primers in the presence of said 
template polynucleotide to form a DNA pool which comprises short 
polynucleotide fragments and said template poUmucleotidets); 

b) denaturing said DNA pool into a pool of singlc-strsLndcd fragments^ 

c) • allowing said single-stranded fragments to anneal, under annealLng 
conditions, to form a pool of anneaJcd fragments; 

d) incubating said pool of ar.ncaled fragments with polymerase under 
conditions which result in cxtensicr. of said double-stranded fragments to 
form a fragment pool comprising cxicndcd single-stranded fragments; 

e) rcpcadng steps b) Lhrc^gh c) until said fragment pool contams said 
mutagcnized polynucleotides. 

. - . • ; . . . i_ 1 . I - _J » : 1 I I-- 

udcs according to cla-rr. 1 v/hcrc':r. said s:nglc-s:randed fragments have areas 
of corr.olcmcntariry ar.d wherein said step of incubadng said pool of annealed 
fragments is conducted under conditions in which the short pol>Tiucleodde 
strands or c:':tcnded short po'/.-nucleodde strands of each of said annealed 
fragments prime each other to form said fragment pool. 

3. A method for m.aldng double-stranded mutagcnized pol>mucleo- 
tides according to claLm I wherein said step of incubating said pool of 
annealed fragm.cnts is conGuc:ed in the presence of said template 
pol>mucleodde(s) to provide r-vicrn repriming of said single-suanded 
poKmucleotides and said templa:-: uc!eoiide(s). 

4. A method for mai-iing double-siranded mutagenized poUmucleo- 
ddes according to claim 1 wherein at least one of said primers is a defined 
seouence orim.er. 
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5. A method for making double-stranded mutagemzed polynucleo- 
tides according to claim 2 wherein at least one of said primers is a defined 
sequence primer. 

6. A method for making double-stranded rautagenlzcd polynucleo- 
tides according to claim 3 wherrin at least one of said primers is a defined 
sequence primer. 

7. A method for making double-stranded mutagenized pol.vnucleo- 
tides according to claim 4 wherein said primer comprises from 6 to 100 
nucleotides. 

8. A method for making double-stranded mutagenized poK-nucleo- 
tides according to claim 5 xvherein said primer comprises from 6 to 100 
nucleotides. 

9. A method for mal-d.-.g do'jble-stranded mutagenized polj-nuclco- 
tides according to claim 6 wherci.-^. said primer comprises from 6 to 100 

nucleotides. 

10. A method for ma:--Jng do'Jblc-strar.ded mutagenized po!;,-nuclco- 
tldes according to claim ^ wherein a: least or.e dcfmed terminal primer is 

used. 

11. A method for rr.al-i^ng double-sirandcd mutagenized pohT^.uclco- 

rr. ri:.;- S \wh--c^n at Icast one defined terminal pnmer is 
ades accorcmg to claj... o vn-.c.ii icao. 

used. 

12. A method for making double- stranded mutagenized pohmucleo- 

r„ r-i-.;r-. fi v---"-'T a: least one defmed termLnal primer is 
uaes according to claim o ^ --i '-'^^ 

used. 

13. A method for mai-ur.g double-stranded mutagenized poKmuclco- 
' tides according to claim 1 whcreir. said primers are defmed sequence primers 

e.-chibiting limited randomness a: or.c or more nucleotide posidons ^^^thin the 

primer. 
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14. A method for making double-stranded mutagenized polynucleo- 
tides according to claim 13 wherein said primers comprise from 6 to 100 
nucleotides. 

15. A method for making double-stranded mutagenized polynucleo- 
tides according to claim 13 wherein two or more defmed primers specific for 
any region of the template are used. 

16. A method for making double-stranded mutagenized polynucleo- 
tides according to claim 1 wherein said primers are defined sequence primers 
exhibiting limited randomness at more than 30% of the nucleotide positions 
within the primer. 

17. A method for maldng double-stranded mutagenized pol>-nucleo- 
tides according to claim 16 wherein said primers comprise from 6 to 100 
nuclcoiides. 

IS. A method for making double-stranded mutagenized pol>Tiucleo- 
tides according to claim 16 wherein r^vo or more defined primers spccL^ic for 
any region of the template arc used. 

19. A method for mai-ung coublc-sL^andcd mutagenized pol>'nuclco- 
tides according to claim i v.-hcrcin said primers aje defined sequence prim.ers 
ounibiung limited rar.dcrr.r.-jss a: more thar. 60Vo of the nucleotide positions 
within the primer. 

20. A method for m.ai-ung double-stranded mutagenized polynucleo- 
tides according to claim 19 wherein said primers com.prisc from 6 to 100 
nucleotides. 

21. A mcLhod for mal-ung double-stranded mutagenized polynuclco- 
Udcs according to claLm 19 wherein r^vo or more defined primers specL^c for 
any regions of the tcm.pl a:e(si arc use-. 

22. A method for mai-ung double-stranded mutagenized pol>Tiucleo- 
tides according to claim 1 v/herein said primers are random-sequence 
crimpers . 
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23. A method for making double-stranded mutagenized polynucleo- 
tides according to claim 22 wherein the lengths of said primers are from 6 to 
24 nucleotides long. 

24. A method for making double-stranded mutagenized polynucleo- 
tides according to claim 22 wherein said template polynucleotide(s) arc 
removed from said DNA pool after generation of said short polynucleotide 
fragments. 

25. A method for mai-dng double-stranded mutagenized polynucleo- 
tides according to claim 1 which includes the additional steps of isolating said 
mutagenized double-stranded polynucleotides from said DNA pool and 
am.plif>'ing said mutagenized double-stranded polynucleotides. 

26. A method for making double-stranded mutagenized polynucleo- 
tides accordmg to claim 25 wherein said mutagenized double-siranded 
pol>Tiucleotides are amplified by the pol>'mcrase chain reaction. 

27. A method for procucLng an cnr,-me comprising the steps of: 

a) insertir.g ir.io a vecior a double-sU'anded mutagenized 
pol-.-nuclcotide made according to claim I to form an expression vector, said 
rr.uia^er.Lzcd oolvmuclcoLidc er.codLng ar. cnz}Tr.c; 

b) transforrr.ir.g a host cell with said expression vector; and 

c) expressing th.c cnz^^.-me encoded by said m.utagcnized 
poi^-nucleotide. 

2S. t\ process for preparing double-siranded mutagenized 
polynucleotides from at least one template polynucleotide, said mutagenized 
polynucleotides ha-nng at least one nucleotide which is different from the 
nucleotide at the corresponding position in said template pol>Tiucleotide, 
v/herein said process comprises: 

(a) performL^.g crjr™.c-cata]>'zcd DNA polymerization from 
random-sequence or defined-secucncc primers Ln the presence of said 

' template pol>TiucleoLide(s) to icrm. a DNA pool containing short polynucleotide 
fragments and said template pD'.>-nucieotide(s); 

(b) denanirmg said DN.A pool into a pool of both single-stranded 
fragm.ent pol>muclcotides and sLngle-stranded template poKmucleoddes; 
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(c) allowing the single-stranded polynucleotides of said pool to 
anneal, under annealing conditions, to form a pool of double-stranded 

annealed polynucleotides; 

(d) incubating said pool of annealed polynucleotides with DNA 
polymerase under conditions which result in extension of said double- 

, stranded polynucleotides . to form a DNA pool containing extended double- 
stranded polynucleotides; and 

(c) repeating steps (b) through (d) until said DNA pool 
containing extended double-stranded polynucleotides contains said 
mutagenized polynucleotides. 

29. The process according to claim 28 wherein said pool of singlc- 

• stranded fragment polynucleotides and single-stranded template polyr.ucleo- 

• tides contain sLngle-stranded fragment polynucleotides having regions 
complementary to regions of other single-stranded fragment polynucleotides in 
said pool such that these fragment po!>-nucl:otides anneal to each other in 
step (c), and prime each other in step (d). 

30. The process according to claim 28 wherein said single-stranded 
template pol>-nucleotide(s) anneal to at leas: some of the sLngle-stranded 
fragment pol:mucleolid=s, in step (c), so as to provide random re-priming of 
saiH .inglc-su-anded fragn^.ent pol:.^.uclco*Jd=3 in step (d). 

31. A process for preparing double-stranded mutagenized 
pob-nucleoddes from at least r.vo template pol>-nucleolides, said template 
polvnucleotides includL-.g a first template polynucleotide and a second 
"template pol>-nucl=otid= which differ from each other, said mutagenized 
pol>^ucl=otid=s having at least one nucleotide which is different from the 
nucleotide at the corresponding position in said first template pol>-nucleot.de 
and at least one other nucleotide which is different from that at the 
corrcspondung position in said second template polynucleotide, wherein said 

process comprises: 

(a) performing cnr-.me-cataJ\-zed DNA pohmerization either 

■ from a set of random-sequence pr~.=rs or from at least one denned-sequencc 
primer, upon said template pohT.udeotides under standard DNA pol>meriza- 
lion conditions or under conditions resulting in only partial extension, to form 
a DNA pool containing poli.-nucleotide fragments a.nd said template 
Dolvnucleotides; 
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(b) denaturing said DNA pool into a pool of both single-stranded 
fragment polynucleotides and single-stranded template polynucleotides; 

(c) allowing the single-stranded polynucleotides of said pool to 
anneal, under annealing conditions, to form a pool of double-stranded 

annealed polynucleotides; 

(d) incubating said pool of annealed polynucleotides with DNA 
polymerase under conditions which result in full or partial extension of said 
double-stranded polynucleotides to form a DNA pool containing extended 
double-stranded pol>Tiucleotides; and 

(e) repeating steps (b) through (d) until said DNA pool 
containing extended double-stranded , pol>'nuclco tides contains . said 
mutagcnizcd polynucleotides; 

provided that, when (1) standard DNA polymerization conditions are used in 
step (b) or (2) full extension is the result in step (d). if at least one defincd- 
sequence primer is used, at least one such primer must be a non-tcrmmal 
primer. 

32. The process according id claim 31 wherein said first template 
pol>Tiucleotidc differs from said second :em,p!ate pol>nucleotide in at least c^vo 
base pairs. 

33. The process according to claim. 32 wherein said r.vo base pairs 
arc separated from each other. 

34. The process according to claim 33 wherein said r.vo base pairs 
arc separated from, each o'dncr by at leasi abou: 15 base pairs., 
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